Hybrid expression of neisserial proteins

ABSTRACT

Two or more Neisserial proteins (e.g. A and B) are expressed as a single hybrid protein which can be represented simply by the formula NH 2 -A-B—COOH.

[0001] All documents cited herein are incorporated by reference in their entirety.

TECHNICAL FIELD

[0002] This invention is in the field of protein expression. In particular, it relates to the heterologous expression of proteins from Neisseria (e.g. N. gonorrhoeae or, preferably, N. meningitidis).

BACKGROUND ART

[0003] International patent applications WO99/24578, WO99/36544, WO99/57280 and WO00/22430 disclose proteins from Neisseria meningitidis and Neisseria gonorrhoeae. These proteins are typically described as being expressed in E. coli (i.e. heterologous expression) as either N-terminal GST-fusions or C-terminal His-tag fusions, although other expression systems, including expression in native Neisseria, are also disclosed.

[0004] It is an object of the present invention to provide alternative and improved approaches for the heterologous expression of these proteins. These approaches will typically affect the level of expression, the ease of purification, the cellular localisation of expression, and/or the immunological properties of the expressed protein.

DISCLOSURE OF THE INVENTION

[0005] In accordance with the invention, two or more (e.g. 3, 4, 5, 6 or more) proteins of the invention are expressed as a single hybrid protein. It is preferred that no non-Neisserial fusion partner (e.g. GST or poly-His) is used.

[0006] This offers two advantages. Firstly, a protein that may be unstable or poorly expressed on its own can be assisted by adding a suitable hybrid partner that overcomes the problem. Secondly, commercial manufacture is simplified—only one expression and purification need be employed in order to produce two separately-useful proteins.

[0007] Thus the invention provides a method for the simultaneous heterologous expression of two or more proteins of the invention, in which said two or more proteins of the invention are fused (i.e. they are translated as a single polypeptide chain).

[0008] The method will typically involve the steps of: obtaining a first nucleic acid encoding a first protein of the invention; obtaining a second nucleic acid encoding a second protein of the invention; ligating the first and second nucleic acids. The resulting nucleic acid may be inserted into an expression vector, or may already be part of an expression vector.

[0009] Where just two proteins are joined, the hybrid protein can be represented simply by the formula NH₂-A-B—COOH. A and B can each be selected from any Neisserial proteins, and in particular those represented by SEQ#s 1-4326. The method is well suited to the expression of proteins orf1, orf4, orf25, orf40, Orf46146.1, orf83, 233, 287, 292L, 564, 687, 741, 907, 919, 953, 961 and 983.

[0010] The 42 hybrids indicated by ‘X’ in the following table of form NH₂-A-B—COOH are preferred: B→ ↓A ORF46.1 287 741 919 953 961 983 ORF46.1 X X X X X X 287 X X X X X X 741 X X X X X X 919 X X X X X X 953 X X X X X X 961 X X X X X X 983 X X X X X X

[0011] Preferred proteins to be expressed as hybrids are thus ORF46.1, 287, 741, 919, 953, 961 and 983. These may be used in their essentially full-length form, or poly-glycine deletions (ΔG) forms may be used (e.g. ΔG-287, ΔGTbp2, ΔG741, ΔG983 etc.), or truncated forms may be used (e.g. Δ1-287, Δ2-287 etc.), or domain-deleted versions may be used (e.g. 287B, 287C, 287BC, ORF46₁₋₄₃₃, ORF46₄₃₃₋₆₀₈, ORF46, 961c etc.) and so on.

[0012] Particularly preferred are: (a) a hybrid protein comprising 919 and 287; (b) a hybrid protein comprising 953 and 287; (c) a hybrid protein comprising 287 and ORF46.1; (d) a hybrid protein comprising ORF1 and ORF46.1; (e) a hybrid protein comprising 919 and ORF46.1; (f) a hybrid protein comprising ORF46.1 and 919; (g) a hybrid protein comprising ORF46.1, 287 and 919; (h) a hybrid protein comprising 919 and 519; and (i) a hybrid protein comprising ORF97 and 225.

[0013] Further embodiments are shown in the drawings and include ΔG287-919, ΔG287-953, ΔG287-961, ΔG983-ORF46.1, ΔG983-741, ΔG983-961, ΔG983-961C, ΔG741-961, ΔG741-961C, ΔG741-983, ΔG741-ORF46.1, ORF46.1-741, ORF46.1-961, ORF46.1-961C, 961-ORF46.1, 961-741, 961-983, 961C-ORF46.1, 961C-741, 961C-983, 961CL-ORF46.1, 961CL-741, and 961CL-983.

[0014] Where 287 is used, it is preferably at the C-terminal end of a hybrid; if it is to be used at the N-terminus, if is preferred to use a ΔG form of 287 is used (e.g. as the N-terminus of a hybrid with ORF46.1, 919, 953 or 961).

[0015] Where 287 is used, this is preferably from strain 2996 or from strain 394/98.

[0016] Where 961 is used, this is preferably at the N-terminus. Domain forms of 961 may be used.

[0017] Alignments of polymorphic forms of ORF46, 287, 919 and 953 are disclosed in WO00/66741. Any of these polymorphs can be used according to the present invention.

[0018] Preferably, the constituent proteins (A and B) in a hybrid protein according to the invention will be from the same strain.

[0019] The fused proteins in the hybrid may be joined directly, or may be joined via a linker peptide e.g. via a poly-glycine linker (i.e. G_(n) where n=3, 4, 5, 6, 7, 8, 9, 10 or more) or via a short peptide sequence which facilitates cloning. It is evidently preferred not to join a ΔG protein to the C-terminus of a poly-glycine linker.

[0020] The fused proteins may lack native leader peptides or may include the leader peptide sequence of the N-terminal fusion partner.

[0021] Host

[0022] It is preferred to utilise a heterologous host. The heterologous host may be prokaryotic or eukaryotic. It is preferably E. coli, but other suitable hosts include Bacillus subtilis, Vibrio cholerae, Salmtonella typhi, Salmonenna typhimiurium, Neisseria meningitidis, Neisseria gonorrhoeae, Neisseria lactamica, Neisseria cinerea, Mycobateria (e.g. Mtuberculosis), yeast etc.

[0023] Vectors, Hosts etc.

[0024] As well as the methods described above, the invention provides (a) nucleic acid and vectors useful in these methods (b) host cells containing said vectors (c) proteins expressed or expressable by the methods (d) compositions comprising these proteins, which may be suitable as vaccines, for instance, or as diagnostic reagents, or as immunogenic compositions (e) these compositions for use as medicaments (e.g. as vaccines) or as diagnostic reagents (f) the use of these compositions in the manufacture of (1) a medicament for treating or preventing infection due to Neisserial bacteria (2) a diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised against Neisserial bacteria, and/or (3) a reagent which can raise antibodies against Neisserial bacteria and (g) a method of treating a patient, comprising administering to the patient a therapeutically effective amount of these compositions.

[0025] Sequences

[0026] The invention also provides a protein or a nucleic acid having any of the sequences set out in the following examples. It also provides proteins and nucleic acid having sequence identity to these. As described above, the degree of ‘sequence identity’ is preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more).

[0027] Nomenclature Herein

[0028] The 2166 protein sequences disclosed in WO99/24578, WO99/36544 and WO99/57280 are referred to herein by the following SEQ# numbers: Application Protein sequences SEQ# herein WO99/24578 Even SEQ IDs 2-892 SEQ#s 1-446 WO99/36544 Even SEQ IDs 2-90 SEQ#s 447-491 WO99/57280 Even SEQ IDs 2-3020 SEQ#s 492-2001 Even SEQ IDs 3040-3114 SEQ#s 2002-2039 SEQ IDs 3115-3241 SEQ#s 2040-2166

[0029] In addition to this SEQ# numbering, the naming conventions used in WO99/24578, WO99/36544 and WO99/57280 are also used (e.g. ‘ORF4’, ‘ORF40’, ‘ORF40-1’ etc. as used in WO99/24578 and WO99/36544; ‘m919’, ‘g919’ and ‘a919’ etc. as used in WO99/57280).

[0030] The 2160 proteins NMB0001 to NMB2160 from Tettelin et al. [Science (2000) 287:1809-1815] are referred to herein as SEQ#s 2167-4326 [see also WO00/66791].

[0031] The term ‘protein of the invention’ as used herein refers to a protein comprising:

[0032] (a) one of sequences SEQ#s 1-4326; or

[0033] (b) a sequence having sequence identity to one of SEQ#s 1-4326; or

[0034] (c) a fragment of one of SEQ#s 1-4326.

[0035] The degree of ‘sequence identity’ referred to in (b) is preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). This includes mutants and allelic variants [e.g. see WO00/667,41]. Identity is preferably determined by the Smith-Waterman homology search algorithm as implemented in the MPSRCH program (Oxford Molecular), using an affine gap search with parameters gap open penalty=12 and gap extension penalty=1. Typically, 50% identity or more between two proteins is considered to be an indication of functional equivalence.

[0036] The ‘fragment’ referred to in (c) should comprise at least n consecutive amino acids from one of SEQ#s 1-4326 and, depending on the particular sequence, n is 7 or more (eg. 8, 10, 12, 14, 16, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100 or more). Preferably the fragment comprises an epitope from one of SEQ#s 1-4326. Preferred fragments are those disclosed in WO00/71574 and WO01/04316.

[0037] Preferred proteins of the invention are found in N. meningitidis serogroup B.

[0038] Preferred proteins for use according to the invention are those of serogroup B N. meningitidis strain 2996 or strain 394/98 (a New Zealand strain). Unless otherwise stated, proteins mentioned herein are from N. meningitidis strain 2996. It will be appreciated, however, that the invention is not in general limited by strain. References to a particular protein (e.g. ‘287’, ‘919’ etc.) may be taken to include that protein from any strain.

[0039] It will be appreciated that references to “nucleic acid” includes DNA and RNA, and also their analogues, such as those containing modified backbones, and also peptide nucleic acids (PNA) etc.

BRIEF DESCRIPEON OF DRAWINGS

[0040] FIGS. 1 to 26 show hybrid proteins according to the invention.

MODES FOR CARRYING OUT THE INVENTION EXAMPLE 1 Hybrids of ORF46

[0041] The complete ORF46 protein from N. meningitidis (serogroup B, strain 2996) has the following sequence: 1 LGISRKISLI LSILAVCLPM HAHASDLAND SFIRQVLDRQ HFEPDGKYHL 51 FGSRGELAER SGHIGLGKIQ SHQLGNIMIQ QAAIKGNIGY IVRFSDHGHE 101 VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD GYDGPQGGGY 151 PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH NAGSMLTQGV 201 GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE IVGAGDAVQG 251 ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA AIRDWAVQNP 301 NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK RSQMGAIALP 351 KGKSAVSDNF ADAAYAKYPS PYHSPNIRSN LEQRYGKENI TSSTVPPSNG 401 KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTKLDIQEL SGGGIPKAXP 451 VSDAKPRWEV DRKLNKLTTR EQVEENVQEI RNGNKNSNFS QHAQLEREIN 501 KLKSADEINF ADGMGKFTDS MNDKAFSELV KSVKENGFTN PVVEYVEING 551 KAYIVRGNNR VFAAEYLGRI HELKFKKVDF PVPNTSWKNP TDVLNESGNV 601 KRPRYRSK*

[0042] The leader peptide is underlined.

[0043] The sequences of ORF46 from other strains can be found in WO00/66741.

[0044] ORF46 has been fused at its C-terminus and N-terminus with 287, 919, and ORF1. The hybrid proteins were generally insoluble, but gave some good ELISA and bactericidal results (against the homologous 2996 strain): Protein ELISA Bactericidal Ab Orf1-Orf46.1-His 850 256 919-Orf46.1-His 12900 512 919-287-Orf46-His n.d. n.d. Orf46.1-287His 150 8192 Orf46.1-919His 2800 2048 Orf46.1-287-919His 3200 16384

[0045] For comparison, ‘triple’ hybrids of ORF46.1, 287 (either as a GST fusion, or in ΔG287 form) and 919 were constructed and tested against various strains (including the homologous 2996 strain) versus a simple mixture of the three antigens. FCA was used as adjuvant: 2996 BZ232 MC58 NGH38 F6124 BZ133 Mixture 8192 256  512 1024 >2048  >2048 ORF46.1-287- 16384  256 4096 8192 8192  8192 919his ΔG287-919- 8192  64 4096 8192 8192 16384 ORF46.1his ΔG287- 4096 128  256 8192  512  1024 ORF46.1- 919his

[0046] Again, the hybrids show equivalent or superior immunological activity.

[0047] Hybrids of two proteins (strain 2996) were compared to the individual proteins against various heterologous strains: 1000 MC58 F6124 (MenA) ORF46.1-His <4 4096 <4 ORF1-His 8 256 128 ORF1 - ORF46.1-His 1024 512 1024

[0048] Again, the hybrid shows equivalent or superior immunological activity.

EXAMPLE 2 Hybrids of ΔG287

[0049] The deletion of the (Gly)₆ sequence in 287 was found to have a dramatic effect on protein expression. The protein lacking the N-terminal amino acids up to GGGGGG is called ‘ΔG287’. In strain MC58, its basic sequence (leader peptide underlined) is:     SPDVKS ADTLSKPAAP VVSEKETEAK EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSPARFRRS ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD*

[0050] ΔG287, with or without His-tag (‘ΔG287-His’ and ‘ΔG287K’, respectively), are expressed at very good levels in comparison with the ‘287-His’ or ‘287^(untagged)’.

[0051] On the basis of gene variability data, variants of ΔG287-His were expressed in E. coli from a number of MenB strains, in particular from strains 2996, MC58, 1000, and BZ232. The results were also good—each of these gave high ELISA titres and also serum bactericidal titres of >8192. ΔG287K, expressed from pET-24b, gave excellent titres in ELISA and the serum bactericidal assay.

[0052] Deletion of poly-Gly sequences is also applicable to Thp2 (NMB0460), 741 (NMB 1870) and 983 (NMB1969). When cloned in pET vector and expressed in E. coli without the sequence coding for their leader peptides and without poly-Gly (i.e. as “ΔG forms”), the same effect was seen—expression was good in the clones carrying the deletion of the poly-glycine stretch, and poor or absent if the glycines were present in the expressed protein.

[0053] ΔG287 was fused directly in-frame upstream of 919, 953, 961 (sequences shown below) and ORF46.1: ΔG287-919 ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGGCCGCTCCTGTTGTTGCTGAAAAAGAGACAGAG GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAAGGCAGCCAAGATATGGCGGCA GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAACAACGGACAAACCCAAAAATGAAGACGAGGGACCGCAAAAT GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGATTCTTCAGATTCCGCCCCC GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTTGATTTGGCTAATGGCGTTTTGATTGATGGG CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTTATTGGATGAAGAAGCACCG TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACGAATTGAGAAATATAAGAAAGATGGGAAAAGCGATAAATTT ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATGTCATCATTTATAAAGACAAGTCCGCTTCA TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGATGCCGCTAATCCCCGTCAAT CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGG AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTCCGTGTGCAAGGCGAACCGGCA AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTAC CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGAT TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGC GGGGATGTTTCCGGAAGGTTTTACGGCCCGGCCGGCGAGGAAGTGGCGGGAAAATACAGCTATCGCCCGACAGATGCG GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCCGGAGGAGGAGGATGCCAAAGCAAGAGC ATCCAAACCTTTCCGCAACCCGACACATCCGTCATCAACGGCCCGGACCGGCCGGTCGGCATCCCCGACCCCGCCGGA ACGACGGTCGGCGGCGGCGGGGCCGTCTATACCGTTGTACCGCACCTGTCCCTGCCCCACTGGGCGGCGCAGGATTTC GCCAAAAGCCTGCAATCCTTCCGCCTCGGCTGCGCCAATTTGAAAAACCGCCAAGGCTGGCAGGATGTGTGCGCCCAA GCCTTTCAAACCCCCGTCCATTCCTTTCAGGCAAAACAGTTTTTTGAACGCTATTTCACGCCGTGGCAGGTTGCAGGC AACGGAAGCCTTGCCGGTACGGTTACCGGCTATTACGAGCCGGTGCTGAAGGGCGACGACAGGCGGACGGCACAAGCC CGCTTCCCGATTTACGGTATTCCCGACGATTTTATCTCCGTCCCCCTGCCTGCCGGTTTGCGGAGCGGAAAAQCCCTT GTCCGCATCAGGCAGACGGGAAAAAACAGCGGCACAATCGACAATACCGGCGGCACACATACCGCCGACCTCTCCCGA TTCCCCATCACCGCGCGCACAACGGCAATCAAAGGCAGGTTTGAAGGAAGCCGCTTCCTCCCCTACCACACGCGCAAC CAAATCAACGGCGGCGCGCTTGACGGCAAAGCCCCGATACTCGGTTACGCCGAAGACCCCGTCGAACTTTTTTTTATG CACATCCAAGGCTCGGGCCGTCTGAAAACCCCGTCCGGCAAATACATCCGCATCGGCTATGCCGACAAAAACGAACAT CCCTACGTTTCCATCGGACGCTATATGGCGGACAAAGGCTACCTCAAGCTCGGGCAGACCTCGATGCAGGGCATCAAA GCCTATATGCGGCAAAATCCGCAACGCCTCGCCGAAGTTTTGGGTCAAAACCCCAGCTATATCTTTTTCCGCGAGCTT GCCGGAAGCAGCAATGACGGTCCCGTCGGCGCACTGGGCACGCCGTTGATGGGGGAATATGCCGGCGCAGTCGACCGG CACTACATTACCTTGGGCGCGCCCTTATTTGTCGCCACCGCCCATCCGGTTACCCGCAAAGCCCTCAACCGCCTGATT ATGGCGCAGGATACCGGCAGCGCGATTAAAGGCGCGGTGCGCGTGGATTATTTTTGGGGATACGGCGACGAAGCCGGC GAACTTGCCGGCAAACAGAAAACCACGGGTTACGTCTGGCAGCTCCTACCCAACGGTATGAAGCCCGAATACCGCCCG TAACTCGAG 1 MASPDVKSAD TLSKPAAPVV AEKETEVKBD APQAGSQGQG APSTQGSQDM 51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS 101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL 151 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI 201 IYKDKSASSS SARFRRSAPS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG 251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE 301 VLHFRTENGR PYPTRGFPAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG 401 KKEQDGSGGG GCQSKSIQTF PQPDTSVING PDRPVGIPDP AGTTVGGGGA 451 VYTVVPHLSL PHWAAQDRAK SLQSFRLGCA NLKNRQGWQD VCAQAFQTPV 501 HSFQAKQFFE RYFTPWQVAG NGSLAGTVTG YYEPVLKGDD RRTAQARPPI 551 YGIPDDFISV PLPAGLRSGK ALVRIRQTGK NSGTIDNTGG THTADLSRFP 601 ITARTTAIKG RFEGSRFLPY HTRNQINGGA LDGKAPILGY AEDPVELFFM 651 HIQGSGRLKT PSGKYIRIGY ADKNEHPYVS IGRYHADKGY LKLGQTSMQG 701 IKAYMRQNPQ RLAEVLGQNP SYIFFRELAG SSNDGPVGAL GTPLMGEYAG 751 AVDRHYITLG APLFVATMIP VTEXALNRLI MAQDTGSAIK GAVRVDYFWG 801 YGDEAGELAG KQKTTGYVWQ LLPNGMKPEY RP* AG287-953 ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGGCCGCTCCTGTTGTTGCTGAAAAAGAGACAGAG GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAAGGCAGCCAAGATATGGCGGCA GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAACAACGGACAAACCCAAAAATGAAGACGAGGGACCGCAAAAT GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGATTCTTCAGATTCCGCCCCC GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTTGATTTGGCTAATGGCGTTTTGATTGATGGG CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTTATTGGATGAAGAAGCACCG TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACGAATTGAGAAATATAAGAAAGATGGGAAAAGCGATAAATTT ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATGTCATCATTTATAAAGACAAGTCCGCTTCA TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGATGCCGCTAATCCCCGTCAAT CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGG AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTCCGTGTGCAAGGCGAACCGGCA AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTAC CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATCTGTGGACGGCATGATCGACAGCGGCGATGAT TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGC GGGGATGTTTCCGGAAGGTTTTTACGGCCCGGCCGGCGAGGAAGTGGCGGAAAATACAGCTATCGCCCGACAGATGCG GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCCGGAGGAGGAGGAGCCACCTACAAAGTG GACGAATATCACGCCAACGCCCGTTTCGCCATCGACCATTTCAACACCAGCACCAACGTCGGCGGTTTTTACGGTCTG ACCGGTTCCGTCGAGTTCGACCAAGCAAAACGCGACGGTAAAATCGACATCACCATCCCCGTTGCCAACCTGCAAAGC GGTTCGCAACACTTTACCGACCACCTGAAATCAGCCGACATCTTCGATGCCGCCCAATATCCGGACATCCGCTTTGTT TCCACCAAATTCAACTTCAACGGCAAAAAACTGGTTTCCGTTGACGGCAACCTGACCATGCACGGCAAAACCGCCCCC GTCAAACTCAAAGCCGAAAAATTCAACTGCTACCAAAGCCCGATGGCGAAAACCGAAGTTTGCGGCGGCGACTTCAGC ACCACCATCGACCGCACCAAATGGGGCGTGGACTACCTCGTTAACGTTGGTATGACCAAAAGCGTCCGCATCGACATC CAAATCGAGGCAGCCAAACAATAACTCGAG 1 MASPDVKSAD TLSKPAAPVV AEKETBVKED APQAGSQGQG APSTQGSQDM 51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PWNSAESANQ TGNNQPADSS 101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL 151 LDEEAPSKSE FENLNESERI KKYKKDGKSD KFTNLVATAV QANGTNKYVI 201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG 251 HSGNIFAPEG NYRYLTYGAB KLPGGSYALR VQGEPAKGEM LAGTAVYNGE 301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 351 DGNGFXGTWT ENGGGDVSGR FYGPAGEBVA GKYSYRPTDA EKGGFGVFAG 401 KKEQDGSGGG GATYKVDEYH ANARFAIDHF NTSTNVGGFY GLTGSVEFDQ 451 AXRDGKIDIT IPVANLQSGS QHFTDHLKSA DIFDAAQYPD IRFVSTKFNF 501 NGKXLVSVDG NLTHHGKTAP VKLKAEKFNC YQSPMAKTEV CGGDFSTTID 551 RTKWGVDYLV NVGMTKSVRI DIQIEAAKQ* AG287-961 ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGGCCGCTCCTGTTGTTGCTGAAAAAGAGACAGAG GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAAGGCAGCCAAGATATGGCGGCA GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAACAACGGACAAACCCAAAAATGAAGACGAGGGACCGCAAAAT GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGATTCTTCAGATTCCGCCCCC GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTTGATTTGGCTAATGGCGTTTTGATTGATGGG CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTTATTGGATGAAGAAGCACCG TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACGAATTGAGAAATATAAGAAAGATGGGAAAAGCGATAAATTT ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATGTCATCATTTATAAAGACAAGTCCGCTTCA TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGATGCCGCTAATCCCCGTCAAT CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGG AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTCCGTGTGCAAGGCGAACCGGCA AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTAC CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGAT TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGC GGGGATGTTTCCGGAAGGTTTTACGGCCCGGCCGGCGAGGAAGTGGCGGGAAAATACAGCTATCGCCCGACAGATGCG GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCCGGAGGAGGAGGAGCCACAAACGACGAC GATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGA GAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGAC TTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCC AAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACT GATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAG ACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGAT ATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCC GAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGC ACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAAC AAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATT GATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACT CGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTC TCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATCCGAATCGGCA GTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCGGCACTTCGTCCGGT TCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGTAACTCGAG 1 HASPDVKSAD TLSKPAAPVV AEXETEVKED APQAGSQGQG APSTQGSQDM 51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSABSANQ TGNNQPADSS 101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL 151 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI 201 IYKDKSASSS SARFRRSARS RRSLPAENPL IPVNQADTLI VDGEAVSLTG 251 HSGNIFAPEG NYRYLTYGAB KLPGGSYALR VQGEPAKGEM LAGTAVYNGE 301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 351 DGNGFKGTWT ENGGGDVSGR PYGPAGEEVA GKYSYRPTDA EKGGFGVFAG 401 KKEQDGSGGG GATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE 451 DGTITKKDAT AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE 501 SEIEKLTTKL ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV 551 KIDEKLEAVA DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE 601 TKQNVDAKVK AAETAAGKAE AAAGTANTAA DKABAVAAKV TDIKADIATN 651 KDNIAKKANS ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH 701 DTRLNGLDKT VSDLRXETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYK 751 SESAVAIGTG FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEW*

[0054] ELISA Bactericidal ΔG287-953-His 3834 65536 ΔG287-961-His 108627 65536

[0055] The bactericidal efficacy (homologous strain) of antibodies raised against the hybrid proteins was compared with antibodies raised against simple mixtures of the component antigens (using 287-GST) for 919 and ORF46.1: Mixture with 287 Hybrid with ΔG287 919 32000 128000 ORF46.1 128 16000

[0056] Data for bactericidal activity against heterologous MenB strains and against serotypes A and C were also obtained: 919 ORF46.1 Strain Mixture Hybrid Mixture Hybrid NGH38 1024 32000 — 16384 MC58 512 8192 — 512 BZ232 512 512 — — MenA (F6124) 512 32000 — 8192 MenC (C11) >2048 >2048 — — MenC (BZ133) >4096 64000 — 8192

[0057] The hybrid proteins with ΔG287 at the N-terminus are therefore immunologically superior to simple mixtures, with ΔG287-ORF46.1 being particularly effective, even against heterologous strains. ΔG287-ORF46.1K may be expressed in pET-24b.

[0058] The same hybrid proteins were made using New Zealand strain 394/98 rather than 2996: AG2B7NZ-919 ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCCCTGTTGTTTCTGAAAAAGAGACAGAG GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCGCACAAGGCGGTCAAGATATGGCGGCG GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCCAAAAATGAAGACGAGGGGGCGCAAAAT GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATCACACCCCGGCTTCGAATATGCCGGCCGGAAAT ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATGGCAAATACGGCGGACGGA ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAAGGTACAAATCAAGCCGAAAAC AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGCGGTGGTGATTTTGGAAGG ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGT AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAAATCAGAATTTGAAAAATTAAGTGATGCAGACAAAATAAGT AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG GGAATCAATCAATATATTATCTTTTATAAACCTAAACCCACTTCATTTGCGCGATTTAGGCGTTCTGCACGGTCGAGG CGGTCGCTTCCGGCCGAGkTGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGC CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAkGGGAATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCC GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGCACGGCAGTGTACAACGGC GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCGTCCAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGC AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGAT GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAAAGTTTTACGGCCCGGCCGGCGAG GAAGTGGCGGGAAAATACAGCTATCGCCCAACAGATGCGGAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAGGAG CAGGATGGATCCGGAGGAGGAGGATGCCAAAGCAAGAGCATGCAAACCTTTCCGCAACCCGACACATCCGTCATCAAC GGCCCGGACCGGCCGGTCGGCATCCCCGACCCCGCCGGAACGACGGTCGGCGGCGGCGGGGCCGTCTATACCGTTGTA CCGCACCTGTCCCTGCCCCACTGGGCGGCGCAGGATTTCGCCAAAAGCCTGCAATCCTTCCGCCTCGGCTGCGCCAAT TTGAAAAACCGCCAAGGCTGGCAGGATGTGTGCGCCCAAGCCTTTCAAACCCCCGTCCATTCCTTTCAGGCAAAACAG TTTTTTGAACGCTATTTCACGCCGTGGCAGGTTGCAGGCAACGGAAGCCTTGCCGGTACGGTTACCGGCTATTACGAG CCGGTGCTGAAGGGCGACGACAGGCGGACGGCACAAGCCCGCTTCCCGATTTACGGTATTCCCGACGATTTTATCTCC GTCCCCCTGCCTGCCGGTTTGCGGAGCGGAAAAGCCCTTGTCCGCATCAGGCAGACGGGAAAAAACAGCGGCACAATC GACAATACCGGCGGCACACATACCGCCGACCTCTCCCGATTCCCCATCACCGCGCGCACAACGGCAATCAAAGGGAGG TTTGAAGGAAGCCGCTTCCTCCCCTACCACACGCGCAACCAAATCAACGGCGGCGCGCTTGACGGCAAAGCCCCGATA CTCGGTTACGCCGAAGACCCCGTCGAACTTTTTTTTATGCACATCCAAGGCTCGGGCCGTCTGAAAACCCCGTCCGGC AAATACATCCGCATCGGCTATGCCGACAAAAACGAACATCCCTACGTTTCCATCGGACGCTATATGGCGGACAAAGGC TACCTCAAGCTCGGGCAGACCTCGATGCAGGGCATCAAAGCCTATATGCGGCAAAATCCGCAACGCCTCGCCGAAGTT TTGGGTCAAAACCCCAGCTATATCTTTTTCCGCGAGCTTGCCGGAAGCAGCAATGACGGTCCCGTCGGCGCACTGGGC ACGCCGTTGATGGGGGAATATGCCGGCGCAGTCGACCGGCACTACATTACCTTGGGCGCGCCCTTATTTGTCGCCACC GCCCATCCGGTTACCCGCAAAGCCCTCAACCGCCTGATTATGGCGCAGGATACCGGCAGCGCGATTAAAGGCGCGGTG CGCGTGGATTATTTTTGGGGATACGGCGACGAAGCCGGCGAACTTGCCGGCAAACAGAAAACCACGGGTTACGTCTGG CAGCTCCTACCCAACGGTATGAAGCCCGAATACCGCCCGTAAAAGCTT 1 HASPDVKSAD TLSKPAAPVV SEKETEAXED APQAGSQGQG APSAQGGQDM 51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM 101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG 151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL 201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV 251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AENPLIPVNQ 301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 401 GDGLHHGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 451 RPTDAEKGGF GVFAGKKEQD GSGGGGCQSK SIQTFPQPDT SVINGPDRPV 501 GIPDPAGTTV GGGGAVYTVV PHLSLPHWAA QDFAKSLQSF RLGCANLKNR 553 QGWQDVCAQA FQTPVHSFQA KQFFERYFTP WQVAGNGSLA GTVTGYYEPV 601 LKGDDRRTAQ ARFPIYGIPD DFISVPLPAG LRSGKALVRI RQTGKNGSTI 651 DNTGGTHTAD LSRFPITART TAIKGRFEGS RPLPYHTRNQ INGGALDGKA 701 PILGYAEDPV ELFFMHIQGS GRLKTPSGKY IRIGYADKNB HPYVSIGRYM 751 ADKGYLKLGQ TSMQGIKAYM RQNPQRLAEV LGQNPSYIFF RELAGSSNDG 801 PVGALGTPLM GEYAGAVDRH YITLGAPLFV ATAHPVTEKA LNRLIMAQDT 851 GSAIKGAVRV DYFWGYGDEA GELAGKQKTT GYVWQLLPNG MKPEYRP* ΔG287NZ-953 ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCCCTGTTGTTTCTGAAAAAGAGACAGAG GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCGCACAAGGCGGTCAAGATATGGCGGCG GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCCAAAAATGAAGACGAGGGGGCGCAAAAT GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATCACACCCCGGCTTCGAATATGCCGGCCGGAAAT ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATGGCAAATACGGCGGACGGA ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAAGGTACAAATCAAGCCGAAAAC AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGT AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAAATCAGAATTTGAAAAATTAAGTGATGCAGACAAAATAAGT AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG GGAATCAATCAATATATTATCTTTTATAAACCTAAACCCACTTCATTTGCGCGATTTAGGCGTTCTGCACGGTCGAGG CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGC CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGAATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCC GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGCACGGCAGTGTACAACGGC GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCGTCCAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGC AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGAT GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAAAGTTTTACGGCCCGGCCGGCGAG GAAGTGGCGGGAAAATACAGCTATCGCCCAACAGATGCGGAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAG CAGGATGGATCCGGAGGAGGAGGAGCCACCTACAAAGTGGACGAATATCACGCCAACGCCCGTTTCGCCATCGACCAT TTCAACACCAGCACCAACGTCGGCGGTTTTTACGGTCTGACCGGTTCCGTCGAGTTCGACCAAGCAAAACGCGACGGT AAAATCGACATCACCATCCCCGTTGCCAACCTGCAAAGCGGTTCGCAACACTTTACCGACCACCTGAAATCAGCCGAC ATCTTCGATGCCGCCCAATATCCGGACATCCGCTTTGTTTCCACCAAATTCAACTTCAACGGCAAAAAACTGGTTTCC GTTGACGGCAACCTGACCATGCACGGCAAAACCGCCCCCGTCAAACTCAAAGCCGAAAAATTCAACTGCTACCAAAGC CCGATGGCGAAAACCGAAGTTTGCGGCGGCGACTTCAGCACCACCATCGACCGCACCAAATGGGGCGTGGACTACCTC GTTAACGTTGGTATGACCAAAAGCGTCCGCATCGACATCCAAATCGAGGCAGCCAAACAATAAAAGCTT 1 NASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM 51 AAVSEBNTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM 101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG 151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL 201 THCKGDSCSG NNFLDEEVQL KSEPEKLSDA DKISNYKKDG KNDGKNDKFV 251 GLVADSVQMK GINQYIIFYK PKPTSFAPFR RSARSRRSLP AEMPLIPVNQ 301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 351 SKGEMLAGTA VYNGEVUIFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 401 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 451 RPTDAEKGGF GVFAGKKEQD GSGGGGATYK VDEYHANARF AIDHFNTSTN 501 VGGFYGLTGS VEFDQAKRDG KIDITIPVAN LQSGSQHFTD HLKSADIFDA 551 AQYPDIRFVS TKFNFNGKKL VSVDGNLTMH GKTAPVKLKA EKFNCYQSPM 601 AKTEVCGGDF STTIDRTKWG VDYLVNVGMT KSVRIDIQIE AAKQ* ΔG287NZ-961 ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCCCTGTTGTTTCTGAAAAAGAGACAGAG GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCGCACAAGGCGGTCAAGATATGGCGGCG GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCCAAAAATGAAGACGAGGGGGCGCAAAAT GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATCACACCCCGGCTTCGAATATGCCGGCCGGAAAT ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATGGCAAATACGGCGGACGGA ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAAGGTACAAATCAAGCCGAAAAC AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGCGGTGGTGATTTTGGAAGG ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGT AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAAATCAGAATTTGAAAAATTAAGTGATGCAGACAAAATAAGT AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTCGGTTTGGTTGCCGATAGTGTGCAGATGAAG GGAATCAATCAATATATTATCTTTTATAAACCTAAACCCACTTCATTTGCGCGATTTAGGCGTTCTGCACGGTCGAGG CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGC CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGAATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCC GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGCACGGCAGTGTACAACGGC GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCGTCCAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGC AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGAT GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAAAGTTTTACGGCCCGGCCGGCGAG GAAGTGGCGGGAAAATACAGCTATCGCCCAACAGATGCGGAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAG CAGGATGGATCCGGAGGAGGAGGAGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCC TACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACC AAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTG ACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACA ACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAA TTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTG GCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGAC GAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCT GCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCT GCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTG TACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGC TTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGC AAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAAT GTAACGGCTGCAGTCGGCGGCTACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTT GCCGCCAAAGCAGGCGTGGCAGTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGG TAAAAGCTT 1 MASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM 51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM 101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG 151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL 201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV 251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ 301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 401 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 451 RPTDAEKGGF GVFAGKKEQD GSGGGGATND DDVKKAATVA IAAAYNNGQE 501 INGFKAGETI YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV 551 NENKQNVDAK VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG 601 ENITTFAEET KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD 651 EAVKTANEAK QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA 701 VAAKVTDIKA DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK 751 LDTRLASAEK SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY 801 NVGRFNVTAA VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY 851 HVGVNYEW*

EXAMPLE 3 Hybrids of ΔG983

[0059] Protein 983 has the following sequence: 983                                        

 ΔG983 1 MRTTPTFPTK TFKPTAMALA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 51 NSRATTAKSA AVSYAGIKNE HCKDRSNLCA GRDDVAVTDR DAKINAPPPN 101 LHTGDFPNPN DAYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 151 LYGRKERGYN ENYKNYTAYN RKEAPEDGGG KDIEASFDDE AVIETEAKPT 201 DIRHVKEIGH IDLVSHIIGG RSVDGRPAGG IAPDATLHIM NTNDETKNEM 251 MVAAIRNAWV KLGERGVRIV NNSFGTTSRA GTADLFQIAN SEEQYRQALL 301 DYSGGDKTDE GIRIMQQSDY GNLSYHIRNK NNLFIFSTGN DAQAQPNTYA 351 LLPFYEKDAQ KGIITVAGVD RSGEKFKREM YGEPGTEPLE YGSNHCGITA 401 MWCLSAPYEA SVRFTRTNPI QIAGTSFSAP IVTGTAALLL QKYPWMSNDN 451 LRTTLLTTAQ DIGAVGVDSK FGWGLLDAGK ANNGPASFPF GDFTADTKGT 501 SDIAYSFRND ISGTGGLIKK GGSQLQLHGN NTYTGKTIIE GGSLVLYGNN 551 KSDMRVETKG ALIYNGAASG GSLNSDGIVY LADTDQSGAN ETVHIKGSLQ 601 LDGKGTLYTR LGKLLKVDGT AIIGGKLYMS ARGKGAGYLN STGRRVPFLS 651 AAKIGQDYSF FTNIETDGGL LASLDSVEKT AGSEGDTLSY YVRRGNAART 701 ASAAAHSAPA GLKHAVEQGG SNLENLMVEL DASESSATPE TVETAAADRT 751 DMPGIRPYGA TFRAAAAVQR ANAADGVRIF NSLAATVYAD STAAHADHQG 801 RRLKAVSDGL DHNGTGLRVI AQTQQDGGTW EQGGVEGKNR GSTQTVGIAA 851 KTGENTTAAA TLGMGRSTWS ENSANAKTDS ISLFAGIRHD AGDIGYLKGL 901 FSYGRYKNSI SRSTGADEHA EGSVNGTLMQ LGALGGVNVP FAATGDLTVE 951 GGLRYDLLKQ DAPAEKGSAL GWSGNSLTBG TLVGLAGLKL SQPLSDKAVL 1001 FATAGVERDL NGRDYTVTGG FTGATAATGK TGARNMPHTR LVAGLGADVE 1051 FGNGWNGLAR YSYAGSKQYG NHSGRVGVGY RF*

[0060] ΔG983 thus has the following basic sequence:                                       TSAPD FNAGGTGIGS NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPPN LHTGDFPNPN DAYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE LYGRKEHGYN ENYKNYTAYM RKEAPEDGGG KDIEASFDDE AVIETEAKPT DIRHVKEIGH IDLVSHIIGG RSVDGRPAGG IAPDATLHIM NTNDETKNEM MVAAIRNAWV KLGERGVRIV NNSFGTTSRA GTADLFQIAN SEEQYRQALL DYSGGDKTDE GIRLMQQSDY GNLSYHIRNK NMLFIFSTGN DAQAQPNTYA LLPFYEKDAQ KGIITVAGVD RSGEKFKREM YGEPGTEPLE YGSNHCGITA MWCLSAPYEA SVRFTRTNPI QIAGTSFSAP IVTGTAALLL QKYPWMSNDN LRTTLLTTAQ DIGAVGVDSK FGWGLLDAGK AMNGPASFPF GDFTADTKGT SDIAYSFRND ISGTGGLIKK GGSQLQLHGN NTYTGKTIIE GGSLVLYGNN KSDMRVETKG ALIYNGAASG GSLNSDGIVY LADTDQSGAN ETVHIKGSLQ LDGKGTLYTR IGKLLKVDGT AIIGGKLYHS ARGKGAGYLN STGRRVPFLS AAKIGQDYSF FTNIETDGGL LASLDSVEKT AGSEGDTLSY YVRRGNAART ASAAAHSAPA GLKHAVEQGG SNLENLMVEL DASESSATPE TVETAAADRT DMPGIRPYGA TFRAAAAVQH ANAADGVRIF NSLAATVYAD STAAHADMQG RRLKAVSDGL DHNGTGLRVI AQTQQDGGTW EQGGVEGKMR GSTQTVGIAA KTGENTTAAA TLGMGRSTWS ENSANAKTDS ISLFAGIRHD AGDIGYLKGL FSYGRYKNSI SRSTGADEHA EGSVNGTLMQ LGALGGVNVP FAATGDLTVE GGLRYDLLKQ AFABKGSAL GWSGNSLTEG TLVGLAGLKL SQPLSDKAVL FATAGVERDL NGRDYTVTGG FTGATAATGK TGARNMPHTR LVAGLGADVE FGNGWNGLAR YSYAGSKQYG NHSGRVGVGY RF*

[0061] ΔG983 was expressed as a hybrid, with ORF46.1, 741, 961 or 961c at its C-terminus: AG983-ORF46.1 ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGAQGGAGGCGGTAAAGAATTGAAGCTTCTTTCGACGATGAAG GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT ATTATTGGCGGGOGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC GGCAACAACACCTATACGGGCAAAACCATTATCGAkGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTC CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTAT GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGC ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGC GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGC AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGC GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAA CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA CGAGTCGGCGTAGGCTACCGGTTCCTCGACGGTGGCGGAGGCACTGGATCCTCAGATTTGGCAAACGATTCTTTTATC CGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTCGGCAGCAGGGGGGAACTTGCCGAG CGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATGATTCAACAGGCGGCCATTAAA GGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCTTCGACAACCATGCCTCACAT TCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCATTGGGACGGATACGAACACCAT CCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGATATATACAGCTACGAC ATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGACAACGGCTTGCCGACCGTTTC CACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACCCGATACAGCCCCGAGCTGGAC AGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGCGCGGCAGGAGAAATT GTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATGCACGGCTTGGGTCTGCTTTCC ACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCCGCAGCAGCCATCCGC GATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTATGGCAGCCATCCCCATC AAAGGGATTGGAGCTGTTCGGGGAAAATACGGCPTGGGCGGCATCACGGCACATCCTATCAAGCGGTCGCAGATGGGC GCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCATACGCCAAATACCCGTCCCCT TACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATCACCTCCTCAACCGTGCCGCCG TCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGACGGTAAAGGGTTTCCG AATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA 1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVEV 101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE 151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD 201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF 301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP 351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 451 PASFPFGDFT WMTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT 551 DQSGANETVH IKGSLQLDGR GTLYTRLGKL LKVDGTAIIG GKLYMSARGK 601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE 651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE 701 SSATPETVET AAADRTDNPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA 751 ATVYADSTAA HADMQGRBLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG 801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF 851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL 901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR 1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLD 1051 GGGGTGSSDL ANDSFIRQVL DRQHFEPDGK YHLFGSRGEL AERSGHIGLG 1101 KIQSHQLGNL MIQQAAIKGN IGYIVRFSDH GHBVHSPFDN HASHSDSDEA 1151 GSPVDGFSLY RIHWDGYEHH PADGYDGPQG GGYPAPKGAR DIYSYDIKGV 1201 AQNIRLNLTD NRSTGQRLAD RFHNAGSMLT QGVGDGFKRA TRYSPELDRS 1251 GNAAEAFNGT ADIVKNIIGA AGEIVGAGDA VQGISEGSNI AVMHGLGLLS 1301 TENKMARIND LADMAQLKDY AAAAIRDWAV QNPNAAQGIE AVSNIFMAAI 1351 PIKGIGAVRG KYGLGGITAH PIKRSQMGAI ALPKGKSAVS DNFADAAYAK 1401 YPSPYHSRNI RSNLEQRYGK ENITSSTVPP SNGKNVKLAD QRHPKTGVPF 1451 DGKGFPNFBK HVKYDTLEHH HHHH* ΔG983-741 ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTC CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTAT GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGC ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGC GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGC AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGC GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAA CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGATCCGGAGGGGGTGGTGTCGCCGCCGACATCGGTGCGGGGCTTGCC GATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTGACGCTGGATCAGTCCGTCAGGAAAAAC GAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTGACAGCCTCAATACGGGCAAATTGAAG AACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGGCAGCTCATTACCTTGGAGAGTGGAGAG TTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAGCAAATACAAGATTCGGAGCATTCCGGG AAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATCTTTTGACAAGCTTCCCGAAGGC GGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGAAAACTGACCTACACCATAGATTTCGCC GCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGACCTGGCCGCCGCCGATATCAAG CCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAAGCCGAGAAAGGCAGTTACTCCCTCGGT ATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTAAACGGCATACGCCATATCGGCCTT GCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVEV 101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE 151 ASFDDEAVIE TEAKPTDIRH VKBIGHIDLV SHIIGGRSVD GRPAGGIAPD 201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF 301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKRBMYGEP 351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT 551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK 601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE 651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVBQGGSNLE NLMVELDASE 701 SSATPETVET AAADRTDHPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA 751 ATVYADSTAA HADMQGRBLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG 801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF 851 AGIPHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLNQLGAL 901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR 1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE 1051 GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ 1101 GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ IEVDGQLITL ESGEFQVYKQ 1151 SHSALTAPQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT 1201 YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV DLAAADIKPD 1251 GK~HAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL 1301 AAKQLEHHHH HH* ΔG983 -961 ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCA TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT AAAGCCGTCCTGTTTGCAACGGGGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACC GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGTGGCGGAGGCACTGGATCCGCCACAAACGACGACGATGTTAAAAAA GCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGAGAGACCATCTAC GACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGACTTTAAAGGTCTG GGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCCAAAGTAAAAGCT GCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACTGATGCCGCTCTG GATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAGACAAATATCGTA AAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGATATCGCCGATTCA TTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCCGAAGAAACCAAA CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGCACAGCTAATACT GCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAACAAAGATAATATT GCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATTGATGGTCTGAAC GCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACTCGCCTGAACGGT TTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTCTCCGGTCTGTTC CAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATCCGAATCGGCAGTCGCCATCGGT ACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCGGCACTTCGTCCGGTTCTTCCGCAGCC TACCATGTCGGCGTCAATTACGAGTGGCTCGAGCACCACCACCACCACCACTGA 1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVBV 101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMBKEA PEDGGGKDIE 151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDDV SHIIGGRSVD GRPAGGIAPD 201 ATLHIMNTND ETKNBNMVAA IRNAWVKLGE RGVRIVNNSF GTTSRSGTAD 251 LFQIANSEEQ YRQALLDYSG GDKTDEGIHL MQQSDYGNLS YHIRNKIMLP 301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP 351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 401 TAALLLQKYP WHSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 501 GKTIIEGGSL VLYGNNKSDH RVETKGALIY NGAASGGSLN SDGIVYLADT 551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK 601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE 651 GDTLSYYVRR GNAARTASAA AHSAPAGLKE AVBQGGSNLE NLMVELDASE 701 SSATPETVET AAADRTDHPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA 751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWBQGG 801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF 851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL 901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAkTGKTGAR 1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE 1051 GGGGTGSATN DDDVKKAATV AIAAAYNNGQ EINGFKAGET IYDIDEDGTI 1101 TKKDATAADV EADDFKGLGL KKVVTNLTKT VNENKQNVDA KVKAAESEIE 1151 KLTTKLADTD AALADTDAAL DATTNALNKL GENITTFABE TKTNIVKIDE 1201 KLEAVADTVD KHAEAFNDIA DSLDETNTKA DEAVKTANEA KQTAEETKQN 1251 VDAKVKAAET AAGKAEAAAG TANTAADKAE AVAMCVTDIK ADIATNKDNI 1301 AKKANSADVY TREBSDSKFV RIDGLNATTE KLDTRLASAE KSIADHDTEL 1351 NGLDKTVSDL RKETRQGLAE QAALSGLPQP YNVGRFNVTA AVGGYKSESA 1401 VAIGTGFRPT ENFAAKAGVA VGTSSGSSAA YHVGVNYEWL EHHHHHH* AG983 -961c ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCA GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCG GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAkTGACGCA TACIAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAC ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAA AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAG GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCAT ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACG AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGC ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAG TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTAC GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCC AACACATATGCCCTATTGCCATTTTATGAAAAAGACGGTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGT GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATT ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGA ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAAC CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTG GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGAT ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCAC GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATG CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTAT CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACG CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGC GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTAT TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAA GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCC GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCC GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTC CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTAT GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGC ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGC GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGC AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGC GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAA CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACG GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTG GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGAT AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTQAACGGACGCGACTACACGGTAACGGGCGGCTTTACC GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGAT GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGA CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGTGGCGGAGGCACTGGATCCGCCACAAACGACGkCGATGTTAAAAAA GCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAAGCTGGAGAGACCATCTAC GACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCCGACGACTTTAAAGGTCTG GGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTCGATGCCAAAGTAAAAGCT GCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCAGATACTGATGCCGCTCTG GATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAGACTAAGACAAATATCGTA AAAkTTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTCAACGATATCGCCGATTCA TTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAGACGGCCGAAGAAACCAAA CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCCGCTGGCACAGCTAATACT GCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCTACGAACAAAGATAATATT GCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTCAGAATTGATGGTCTGAAC GCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCACGATACTCGCCTGAACGGT TTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCCGCGCTCTCCGGTCTGTTC CAACCTTACAACGTGGGTCTCGAGCACCACCACCACCACCACTGA 1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIB AGYTGRGVEV 101 GIVDTGESVG SISFPELYGR KEEGYNENYK NYTAYMRKEA PEDGGGKDIE 151 ASFDDEAVIE TEAKPTDIBH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD 201 ATLHINNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF 301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP 351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT 551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYHSARGK 601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETPGGLLASL DSVEKTAGSE 651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE 701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAP DGVRIFNSLA 751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG 801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGN GRSTWSENSA NAKTDSISLF 851 AGIEHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLNQLGAL 901 GGVNVPFAkT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 951 LAGLKLSQPL SDKAVLPATA GVERDLNGRD YTVTGGFTGA TAATGKTGAP 1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE 1051 GGGGTGSATN DDDVKKAATV AIAAAYNNGQ EINGFKAGET IYDIDEDGTI 1101 TKKDATAADV EADDFKGLGL KKVVTNLTKT VNENKQNVDA KVKAAESEIE 1151 KLTTKLADTD AALADTDAAL DATTNALNKL GENITTFAEE TKTNIVKIDE 1201 KLEAVADTVD KHAEAFNDIA DSLDETNTKA DEAVKTANEA KQTAEETKQN 1251 VDAKVKAAET AAGKABAAAG TANTAADKAE AVAAKVTDIK ADIATNKDNI 1301 AKKANSADVY TREESDSKFV RIDGLNATTE KLDTRLASAE KSIADHDTRL 1351 NGLDKTVSDL RKETRQGLAE QAALSGLFQP YNVGLEHHHH HH*

EXAMPLE 4 Hybrids of ΔG741

[0062] Protein 741 has the following sequence: 1 VNRTAFCCLS LTTALILTAC SSGGGGVAAD IGAGLADALT APLDHKDKGL 51 QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRPDFIRQ 101 IEVDGQLITL ESGBFQVYKQ SHSALTAFQT EQIQDSBHSG KMVAKRQFRI 151 GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI 201 EHLKSPELNV DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA 251 QEVAGSAEVK TVNGIRHIGL AAKQ*

[0063] ΔG741 thus has the following basic sequence:                           VAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRPDFIRQ IEVDGQLITL ESGEFQVYKQ SHSMJTAPQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPBLNV DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL AAKQ*

[0064] ΔG741 was fused directly in-frame upstream of proteins 961, 961c, 983 and ORF46.1: ΔG741-961 ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGGCGTTTCGACTTTATCCGCCAAATCGAAGTG GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGTGGCGGAGGCACTGGATCCGCCACA AACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTC AAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAA GCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAAC GTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTA GCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAA GAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCA TTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAA CAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCT GCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATC GCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTT GTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGAT CACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAA GCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATCC GAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCGGCACT TCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGCTCGkGCACCACCACCACCACCACTGA 1 MVAJWIGAGL ADALTAPLDH KDKGLQSLTL DQSVRRNBKL KLAAQGAEKT 51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL 101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA 151 PGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA 201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL 251 EGGGGTGSAT NDDDVKKAAT VAIAAAYNNG QEINGFKAGE TIYDIDEDGT 301 ITKKDATAAD VEADDFKGLG LKKVVTNLTK TVNENKQNVD AKVKAAESEI 351 EKLTTKLADT DAALADTDAA LDATTNALNK LGENITTFAE ETKTNIVKID 401 EKLEAVADTV DKHAEAFNDI ADSLDETNTK ADEAVKTANE AKQTAEETKQ 451 NVDAKVKAAE TAAGKAEAAA GTANTAADKA EAVAAKVTDI KADIATNKDN 501 IAKKANSADV YTREESDSKF VRIDGLNATT EKLDTRLASA EKSIADHDTR 551 LNGLDKTVSD LRKETRQGLA EQAALSGLFQ PYNVGRFDVT AAVGGYKSES 601 AVAIGTGFRF TENFAAKAGV AVGTSSGSSA AYHVGVNYEW LEHHHHHH* ΔG741-961c ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA AkCGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAkTCGCCAGAA CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGTGGCGGAGGCACTGGATCCGCCACA AACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTC AAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAA GCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAAC GTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTA GCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAA GAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCA TTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAA CAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCT GCCGCTGGCACAGCTAATACTGCAGCCGAGAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATC GCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTT GTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGAT CACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAA GCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCTCGAGCACCACCACCACCACCACTGA 1 MVAADIGAGL ADALTAPLDH KDKGLGSLTL DQSVRKNEKL KLAAGGAEKT 51 YGNGDSLNTG KLKNDKVSRF DFIRGIEVDG GLITLESGEF GVYKGSHSAL 101 TAFGTEGIGD SEHSGKMVAK RGFRIGDIAG EETSFDKLPE GGRATYRGTA 151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA 201 VISGSVLYNG AEKGSYSLGI FGGKAGEVAG SAEVKTVNGI RHIGLAAKGL 251 EGGGGTGSAT NDDDVKKAAT VAIAAAYNNG GEINGFKAGE TIYDIDEDGT 301 ITKKDATAAD VEADDFKGLG LKKVVTNLTK TVNENKGNVD AKVKAAESEI 351 EKLTTKLADT DAALADTDAA LDATTNALNK LGENITTFAE ETKTNIVKID 401 EKLEAVADTV DKHAEAFNDI ADSLDETNTK ADEAVKTANE AKGTAEETKG 451 NVDAKVKAAE TAAGKAEAAA GTANTAADKA EAVAAKVTDI KADIATNKDN 501 IAKKANSADV YTREESDSKF VRIDGLNATT EKLDTRLASA EKSIADHDTR 551 LNGLDKTVSD LRKETRGGLA EGAALSGLFG PYNVGLEHHH HHH* ΔG741-983 ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGGAG TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGATCCGGCGGAGGCGGCACTTCTGCG CCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTATCTTAC GCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACAGACAGG GATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAGAATTTG ATCAACCTCAAACCTGCAATTGAkGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGCGAATCC GTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTATACGGCG TATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTTATAGAG ACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATTGGCGGG CGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGATGAAACC AAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTCAATAAC AGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGCCAAGCG TTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAACCTGTCC TACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACATATGCC CTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAAAAGTTC AAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCATGTGG TGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCCTTTTCC GCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGTACCACG TTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCGGGTAAG GCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCC TTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAACAACACC TATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTCGAAACC AAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCAGATACC GACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTACACACGT TTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAGGGGGCA GGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTCTTCACA AACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGACACGCTG TCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTGAAACAC GCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAG ACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCA GCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAG GCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGTCTGCGC GTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGTACCCAA ACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGC GAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATCGGCTAT CTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAAGGC AGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGATTTGACG GTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGGAGCGGC AACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCCGTCCTG TTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCA GCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGkTGTCGAATTCGGC AACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTA GGCTACCGGTTCCTCGAGCACCACCACCACCACCACTGA 1 MVAADIGAGL ADALTAPLDH KDKGLGSLTL DGSVRKNEKL KLAAGGAEKT 51 YGNGDSLNTG KLKNDKVSRF DFIRGIEVDG GLITLESGEF GVYKGSHSAL 101 TAFGTEGIGD SBHSGKNVAK RGFRIGDIAG EHTSFDKLPE GGRATYRGTA 151 FGSDDAGGKL TYTIDFAAKG GNGKIEHLKS PELNVDLAAA DIKPDGKRHA 201 VISGSVLYNG AEKGSYSLGI FGGKAGEVAG SABVKTVNGI RHIGLAAKGL 251 EGSGGGGTSA PDFNAGGTGI GSNSRATTAR SAAVSYAGIK NEHCKDRSML 301 CAGRDDVAVT DRDAKINAPP PNLHTGDFPN PNDAYKNLIN LKPAIEAGYT 351 GRGVEVGIVD TGESVGSVGF PELYGRKEHG YNENYKNYTA YMRKEAPEDG 401 GGKDIEASFD DEAVIETEAK PTDIRHVKEI GHIDLVSHII GGRSVDGRPA 451 GGIAPDATLH IMNTNDETKN EMMVAAIRNA WVKLGERGVR IVNNSFGTTS 501 RAGTADLFGI LNSEEGYRGA LLDYSGGDKT DEGIRLMQQS DYGNLSYHIR 551 NKNMLFIFST GNDAGAGPNT YALLPFYEKD AGKGIITVAG VDRSGEKFKR 601 EMYGEPGTEP LEYGSNHCGI TAMWCLSAPY EASVRFTRTN PIGIAGTSFS 651 APIVTGTAAL LLGKYPWMSN DNLRTTLLTT AGDIGAVGVD SKFGWGLLDA 701 GKAHNGPASF PFGDFTADTK GTSDIAYSFR NDISGTGGLI KKGGSGLGLH 751 GNNTYTGKTI IEGGSLVLYG NNKSDMRVET KGALIYNGAA SGGSLNSDGI 801 VYLADTDGSG ANETVHIKGS LGLDGKGTLY TELGKLLKVD GTAIIGGKLY 851 MSARGKGAGY LNSTGRRVPF LSAAKIGGDY SFFTNIETDG GLLASLDSVE 901 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEG GGSNLENLMV 951 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV GHANAADGVR 1001 IFNSLAATVY ADSTAAHADN GGRPLKAVSD GLDHNGTGLR VIAGTGGDGG 1051 TWEGGGVBGK MRGSTGTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 1101 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 1151 MGLGALGGVN VPFAATGDLT VEGGLRYDLL KGDAFAEKGS ALGWSGNSLT 1201 EGTLVGLAGL KLSGPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 1251 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKG YGNHSGRVGV 1301 GYRFLEHHHH HH* ΔG741-ORF46.1 ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTG GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAG ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGC GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCC GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAA CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTAC AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTG AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGACGGTGGCGGAGGCACTGGATCCTCAGAT TTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATAGCACCTATTCGGC AGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATG ATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCGGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCC TTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCAT TGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCG AGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGA CAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACC CGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATC ATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTGAAACATTGCTGTGATG CACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGAC TATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATC TTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGGACATCCT ATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGGGACAATTTTGCCGATGCGGCA TACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATC ACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCG TTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA 1 MVAADIGAGL ADALTAPLDH KDKGLGSLTL DGSVRKNEKL KLAAGGAEKT 51 YGNGDSLNTG KLKNDKVSRF DFIRGIEVDG GLITLESGEF GVYKGSHSAL 101 DAFGTEGIGD SEHSGKMVAK RGFRIGDIAG EHTSFDKLPE GGRATYRGTA 151 FGSDDAGGKL TYTIDFAAKG GNGRIEHLKS PELNVDLAAA DIKPDGKRHA 201 VISGSVLYNG AEKGSYSLGI FGGKAGEVAG SAEVKTVNGI RHIGLAAKGL 251 DGGGGTGSSD LANDSFIRGV LDRGHFBPDG KYHLFGSRGE LAERSGHIGL 301 GKIGSHGLGN LMIGGAAIKG NIGYIVRFSD HGHEVHSPFD NHASHSDSDE 351 AGSPVDGPSL YRIHWDGYEH HPADGYDGPG GGGYPAPKGA RPIYSYDIKG 401 VAGNIRLNLT DNRSTGQRLA DRFHNAGSML TGGVGDGFKR ATRYSPELDR 451 SGNAAEAFNG TADIVKNIIG AAGEIVGAGD AVQGISEGSN IAVHHGLGLL 501 STENKMARIN DLADMAGLKD YAAAAIRDWA VGNPNAAGGI EAVSNIFNAA 551 IPIKGIGAVR GKYGLGGITA HPIKRSGMGA IALPKGKSAV SDNFADAAYA 601 KYPSPYHSRN IRSNLEGRYG KENITSSTVP PSNGKNVKLA DGRHPKTGVP 651 FDGKGFPNFE KHVKYDTLRH HHHHH*

EXAMPLE 5 Hybrids of 287

[0065] Expression of 287 as full-length with a C-terminal His-tag, or without its leader peptide but with a C-terminal His-tag, gives fairly low expression levels. Better expression is achieved using a N-terminal GST-fusion. As an alternative to using GST as an N-terminal fusion partner, 287 was placed at the C-terminus of protein 919 ('919-287′), of protein 953 (‘953-287’), and of proteins ORF46.1 (‘ORF46.1-287’). In both cases, the leader peptides were deleted, and the hybrids were direct in-frame fusions.

[0066] To generate the 953-287 hybrid, the leader peptides of the two proteins were omitted by designing the forward primer downstream from the leader of each sequence; the stop codon sequence was omitted in the 953 reverse primer but included in the 287 reverse primer. For the 953 gene, the 5′ and the 3′ primers used for amplification included a NdeI and a BamHI restriction sites respectively, whereas for the amplification of the 287 gene the 5′ and the 3′ primers included a BamHI and a XhoI restriction sites respectively. In this way a sequential directional cloning of the two genes in pET21b+, using NdeI-BamHI (to clone the first gene) and subsequently BamHI-XhoI (to clone the second gene) could be achieved.

[0067] The 919-287 hybrid was obtained by cloning the sequence coding for the mature portion of 287 into the XhoI site at the 3′-end of the 919-His clone in pET21b+. The primers used for amplification of the 287 gene were designed for introducing a SalI restriction site at the 5′- and a XhoI site at the 3′- of the PCR fragment. Since the cohesive ends produced by the SalI and XhoI restriction enzymes are compatible, the 287 PCR product digested with SalI-XhoI could be inserted in the pET21b-919 clone cleaved with XhoI.

[0068] The ORF46.1-287 hybrid was obtained similarly.

[0069] The bactericidal efficacy (homologous strain) of antibodies raised against the hybrid proteins was compared with antibodies raised against simple mixtures of the component antigens: Mixture with 287 Hybrid with 287 919 32000 16000 953 8192 8192 ORF46.1 128 8192

[0070] Data for bactericidal activity against heterologous MenB strains and against serotypes A and C were also obtained for 919-287 and 953-287: 919 953 ORF46.1 Strain Mixture Hybrid Mixture Hybrid Mixture Hybrid MC58  512 1024  512 1024  — 1024 NGH38 1024 2048 2048 4096  — 4096 BZ232  512  128 1024  16 — — MenA (F6124)  512 2048 2048  32 — 1024 MenC (C11) >2048  n.d. >2048  n.d. — n.d. MenC (BZ133) >4096  >8192  >4096  <16 — 2048

[0071] Hybrids of ORF46.1 and 919 were also constructed. Best results (four-fold higher titre) were achieved with 919 at the N-terminus.

[0072] Hybrids 919-519His, ORF97-225His and 225-ORF97His were also tested. These gave moderate ELISA fitres and bactericidal antibody responses.

[0073] As hybrids of two proteins A & B may be either NH₂-A-B—COOH or NH₂-B-A—COOH, the “reverse” hybrids with 287 at the N-terminus were also made, but using ΔG287. A panel of strains was used, including homologous strain 2996. FCA was used as adjuvant: 287 & 919 287 & 953 287 & ORF46.1 Strain ΔG287-919 919-287 ΔG287-953 953-287 ΔG287-46.1 46.1-287 2996 128000 16000 65536 8192 16384 8192 BZ232 256 128 128 <4 <4 <4 1000 2048 <4 <4 <4 <4 <4 MC58 8192 1024 16384 1024 512 128 NGH38 32000 2048 >2048 4096 16384 4096 394/98 4096 32 256 128 128 16 MenA (F6124) 32000 2048 >2048 32 8192 1024 MenC (BZ133) 64000 >8192 >8192 <16 8192 2048

[0074] Better bactericidal titres are generally seen with 287 at the N-terminus.

[0075] When fused to protein 961 [NH₂-ΔG287-961-COOH—sequence shown above], the resulting protein is insoluble and must be denatured and renatured for purification. Following renaturation, around 50% of the protein was found to remain insoluble. The soluble and insoluble proteins were compared, and much better bactericidal titres were obtained with the soluble protein (FCA as adjuvant): 2996 BZ232 MC58 NGH38 F6124 BZ133 Soluble 65536 128 4096 >2048 >2048 4096 Insoluble  8192  <4  <4   16 n.d. n.d.

[0076] Titres with the insoluble form were, however, improved by using alum adjuvant instead: Insoluble 32768 128 4096 >2048 >2048 2048

[0077] 961c was also used in hybrid proteins (see above). As 961 and its domain variants direct efficient expression, they are ideally suited as the N-terminal portion of a hybrid protein.

EXAMPLE 23 Further Hybrids

[0078] Further hybrid proteins of the invention are shown in the drawings and have the sequences set out below. These are advantageous when compared to the individual proteins: ORF46.1-741. ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCAC CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGC AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTC CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTAC CGCATCCATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAA CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTT AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATT GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAA CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTC AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACG GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCC GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAA GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACA GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGGGATCCGGAGGGGGTGGT GTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCT TTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAAC GGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGAC GGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACC GAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAA CATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGC GGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTC GGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTC AATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAAC CAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAA ACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 1 HSDLANDSFI RGVLDRWHFE PDGKYHLFGS RGELAERSGH IGLGKIGSHG 51 LGNLMIGGAA IKGNIGYIVR FSDHGHBVHS PFDNHASHSD SDEAGSPVDG 101 FSLYRIHWDG YEHHPADGYD GPGGGGYPAP KGARDIYSYD IKGVAGNIRL 151 NLTDNRSTGG RLADRFHNAG SMLTGGVGDG FKRATRYSPE LDRSGNAAFA 201 FNGTADIVKN IIGAAGEIVG AGDAVGGISE GSNIAVNHGL GLLSTENKMA 251 RINDLADMAG LKDYAAAAIR DWAVGNPNAA GGIRAVSNIF MAAIPIKGIG 301 AVRGKYGLGG ITAHPIKRSG MGAIALPKGK SAVSDNFADA AYAKYPSPYH 351 SBNIRSNLEG RYGKENITSS TVPPSNGKNV KLADGRHPKT GVPFDGKGFP 401 NFBKHVKYDT GSGGGGVAAD IGAGLADALT APLDHKDKGL GSLTLDGSVR 451 KNEKLKLAAG GAEKTYGNGD SLNTGKLKND KVSRPDFIRG IEVDGGLITL 501 ESGEFGVYKG SHSALTAPGT EGIGDSEHSG KMVAKRGFRI GDIAGEHTSF 551 DKLPEGGRAT YRGTAPGSDD AGGKLTYTID FAAKGGNGKI EHLKSPELNV 601 DLAAADIKPD GKRHAVISGS VLYNGAEKGS YSLGIFGGKA GEVAGSAEVE 651 TVNGIRHIGL AAKGLBHHHH HH* IRF46.1-961 ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCAC CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGC AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTC CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTAC CGCATCCATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAA CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCAGTGCAGATATCGTT AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATT GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAA CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTC AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACG GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCC GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAA GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACA GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGGGATCCGGAGGAGGAGGA GCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAAC GGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGAT GTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAA CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCC GCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTT GCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCC GAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAA GCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCC GAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCT AAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATT GCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCA GAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCT GATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGC AAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTC GGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGCTCGAGCACCACCACCACCACCAC TGA 1 MSDLANDSFI RGVLDRGHFE PDGKYHLFGS RGELABRSGH IGLGKIGSHG 51 LGNLHIGGAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG 101 FSLYRIHWDG YEHHPADGYD GPGGGGYPAP KGARDIYSYD IKGVAGNIRL 151 NLTDNRSTGG RLADRFHNAG SMLTGGVGDG FKRATRYSPB LDRSGNAAEA 201 FNGTADIVKN IIGAAGEIVG AGDAVGGISE GSNIAVMHGL GLLSTENKMA 251 RINDLADNAG LKDYAAAAIR DWAVGNPNAA GGIEAVSNIF MAAIPIKGIG 301 AVRGKYGLGG ITAHPIKRSG MGAIMaPKGK SAVSDNFADA AYAKYPSPYH 351 SRNIRSNLBG RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP 401 NFEKHVKYDT GSGGGGATND DDVKKAATVA IAAAYNNGGE INGFKAGETI 451 YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV NENKGNVDAK 501 VKAABSEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG ENITTFAEET 551 KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD EAVKTANEAK 601 GTAEETKGNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA VAAKVTDIKA 651 DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK LDTRLASAEK 701 SIADEDTRLN GLDKTVSDLR KETRGGLAEG AALSGLFGPY NVGRFNVTAA 751 VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY HVGVNYEWLE 801 HHHHHH* ORF46.1-961c ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCAC CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGC AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTC CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTAC CGCATCCATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAA CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTT AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATT GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAA CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGGCGTC AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACG GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCC GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAA GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACA GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGGGATCCGGAGGAGGAGGA GCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAAC GGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGAT GTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAA CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCC GCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTT GCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCC GAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAA GCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCC GAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCT GATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGC AAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATT GCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCA GAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCTCGAGCACCACCACCACCACCACTGA 1 MSDLANDSFI RGVLDRGHPE PDGKYHLFGS RGELAERSGH IGLGKIGSHG 51 LGNLMIGGAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG 101 FSLYRIHWDG YEHHPADGYD GPGGGGYPAP KGARDIYSYD IKGVAGNIRL 151 NLTDNRSTGG RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAAEA 203 FNGTADIVKN IIGAAGEIVG AGDAVGGISE GSNIAVHHGL GLLSTENKMA 251 RINDLADMAG LKDYAAAAIR DWAVGNPNAA GGIEAVSNIF MAAIPIKGIG 301 AVRGKYGLGG ITAHPIKRSG NGAIALPKGK SAVSDNFADA AYAKYPSPYH 351 SRNIRSNLBG RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP 401 NFEKHVKYDT GSGGGGATND DDVKKAATVA IAAAYNNGGB INGFKAGETI 451 YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV NENKGNVDAK 501 VKAABSEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG ENITTFAEET 551 KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD EAVKTANEAK 601 GTAEETKQNV DAKVKAAETA AGKABAAAGT ANTAADKAEA VAAKVTDIKA 651 DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK LDTRLASAEK 701 SIADHDTELN GLDKTVSDLR KETRGGLAEG AALSGLFGPY NVGLEHHHHH 751 H* 961-ORF46.1 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC AGCAAATTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAAATCC ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGC TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCA GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGGAGGAGGAGGATCA GATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTGGAACCCGACGGGAAATACCACCTATTC GGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTG ATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCC CCCTTCGACAACCATGGCTCACATTCCGATTCTGATGAAGCCGGTAGTCCGGTTGACGGATTTAGCGTTTACCGCATC CATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGC GCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACC GGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCC ACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAAC ATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGGATAAGCGAAGGGTCAAACATTGCTGTC ATGCACGGCTTGGGTGTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGGAACTCAAA GACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCGGCACAAGGCATAGAAGCCGTCAGCAAT ATCTTTATGGCAGGCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACAT CCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCG GCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAAC ATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTA CCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCAC TGA 1 HATNDDDVKK AATVAIAAAY NNGGEINGFK AGETIYDIDE DGTITKKDAT 51 AADVEADDFK GLGLKKVVTN LTKTVNENKG NVDAKVKAAE SEIEKLTTKL 101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 151 DTVDKHAEAP NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 301 VSDLRKETRG GLAEGAALSG LFGPYNVGRF NVTAAVGGYK SESAVAIGTG 351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGS DLANDSFIRG 401 VLDRGHFBPD GKYHLFGSRG ELAERSGHIG LGKIGSHGLG NLMIGGAAIK 451 GNIGYIVRFS DHGHEVHSPF DNHASHSDSD EAGSPVDGFS LYRIHWDGYE 501 HEPADGYDGP GGGGYPAPKG ARDIYSYDIK GVAGNIBLNL TDNRSTGGRL 551 ADRFHNAGSM LTGGVGDGFK EATRYSPELD RSGNAAEAFN GTADIVKNII 601 GAAGEIVGAG DAVGGISEGS NIAVMHGLGL LSTENKHARI NDLADMAGLK 651 DYAAAAIRDW AVGNPNAAGG IEAVSNIFNA AIPIKGIGAV RGKYGLGGIT 701 AHPIKRSGMG AIALPKGKSA VSDNFADAAY AKYPSPYHSR NIRSNLEGRY 751 GKENITSSTV PPSNGKNVKL ADGRHPKTGV PFDGKGFPNF EKHVKYDTLE 801 HHHHHH* 961-741 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGC TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCA GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGGAGGGGGTGGTGTC GCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTG ACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGT GACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGG CAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAG CAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTCTCAGAATCGGCGACATAGCGGGCGAACAT ACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGA AAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAAT GTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAA GCCGAGAAAGGCAGTTACTCCTCGGTATCTTTGGCGGAAAAGCCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACC GTAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 1 MATNDDDVKK AATVAIAAAY NNGGBINGFK AGETIYDIDE DGTITKKDAT 51 AADVEADDFK GLGLKKVVTN LTKTVNENKG NVDAKVKAAE SEIEKLTTKL 101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDBKEHAVA 151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKGTAEE TKGNVDAKVK 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 301 VSDLRKBTRG GLAEGAALSG LFGPYNVGRF NVTAAVGGYK SESAVAIGTG 351 FRPTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGV AADIGAGLAD 401 ALTAPLDHKD KGLGSLTLDG SVRKNEKLKL AAGGAEKTYG NGDSLNTGKL 451 KNDKVSRFDF IRGIEVDGGL ITLESGEFGV YKGSHSALTA FGTEGIGDSE 501 HSGKMVAKRG FRIGDIAGEH TSFDKLPEGG RATYRGTAFG SDDAGGKLTY 551 TIDFAAKGGN GKIEHLKSPE LNVDLAAADI KPDGKEHAVI SGSVLYNGAE 601 KGSYSLGIFG GKAGEVAGSA EVKTVNGIRE IGLAARGLEH HHHHH* 961-983 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGC TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAADCAGGCGTGGCA GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGGCGGAGGCGGCACT TCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTA TCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACA GACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAG AATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGC GAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTAT ACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTT ATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATT GGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGAT GAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTC AATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGC CAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAAC CTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACA TATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAA AAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCC ATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCC TTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGT ACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCG GGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCC TACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAAC AACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTC GAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCA GATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTAC ACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAG GGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTC TTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGAC ACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTG AAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACA CCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCA GCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGAC AGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGT CTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGT ACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACA TGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATC GGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCG GAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGAT TTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGG AGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCC GTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCG ACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAA TTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTC GGCGTAGGCTACCGGTTCCTCGAGCACCACCACCACCACCACTGA 1 HATNDDDVKK AATVAIAAAY NNGGEINGFK AGETIYDIDE DGTITKKDAT 51 AADVEADDFK GLGLKKVVTN LTKTVNENKG NVDAKVKAAE SEIEKLTTKL 101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKGTAEE TKGNVDAKVK 201 AAETAAGKAE AAAGTANTAA DKAEAVAARV TDIKADIATN KDNIAKKANS 251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 301 VSDLRKETRG GLAEQAALSG LFQPYNVGRF NVTAAVGGYK SESAVAIGTG 351 FRFTENFAAR AGVAVGTSSG SSAAYHVGVN YEWGSGGGGT SAPDFNAGGT 401 GIGSNSRATT AKSAAVSYAG IKNEMCKDRS MLCAGRDDVA VTDRDAKINA 451 PPPNLHTGDF PNPNDAYKNL INLKPAIEAG YTGRGVEVGI VDTGESVGSI 501 SFPELYGRKE HGYNENYKNY TAYMRKEAPE DGGGKDIEAS FDDEAVIETE 551 AKPTDIRHVK EIGHIDLVSH IIGGRSVDGR PAGGIABDAT LHIMNTNDET 601 KNEMMVAAIR NAWVKLGERG VRIVNNSFGT TSRAGTADLF GIANSEEGYR 651 QALLDYSGGD KTDEGIRIMG GSDYGNLSYH IRNKNNLFIF STGNDAGAGP 701 NTYALLPFYE KDAGKGIITV AGVDRSGEKF KPEMYGEPGT EPLEYGSNHC 751 GITAMWCLSA PYEASVRFTR TNPIGIAGTS FSAPIVTGTA ALLLGKYPWM 801 SNDNLRTTLL TTAGDIGAVG VDSKFGWGLL DAGKAMNGPA SFPFGDFTAD 851 TKGTSDIAYS FRNDISGTGG LIKKGGSGLG LHGNNTYTGK TIIEGGSLVL 901 YGNNKSDMRV ETKGALIYNG AASGGSLNSD GIVYLADTDG SGANETVHIK 951 GSLGLDGKGT LYTRLGKLLK VDGTAIIGGK LYMSARGKGA GYLNSTGRRV 1001 PFLSAAKIGG DYSFFTNIET DGGLLASLDS VEKTAGSEGD TLSYYVRRGN 1051 AARTASAAAH SAPAGLKHAV EQGGSNLENL HVELDASESS ATPETVETAA 1101 ADRTDMPGIR PYGATFRAAA AVGHANAADG VRIFNSLAAT VYADSTAAHA 1151 DMGGRRLKAV SDGLDHNGTG LRVIAGTGGD GGTWEGGGVE GKMRGSTGTV 1201 GIAAKTGENT TAAATLGMGR STWSENSANA KTDSISLFAG IRHDAGDIGY 1251 LKGLFSYGRY KNSISRSTGA DEHAEGSVNG TLMGLGALGG VNVPFAATGD 1301 LTVEGGLRYD LLKGDAFAEK GSALGWSGNS LTEGTLVGLA GLKLSGPLSD 1351 KAVLFATAGV ERDLNGRDYT VTGGFTGATA ATGKTGARNM PHTRLVAGLG 1401 ADVEFGNGWN GLARYSYAGS RQYGNHSGRV GVGYRFLEHH HHHH* 961c-ORF46.1 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT GAAGCCAAACAGACGGCCGAAGAAACCAAACTAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAT GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGAGGAGGATCAGATTTGGCA AACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTCGGCAGCAGG GGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATGATTCAA CAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCTTCGAC AACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCATTGGGAC GGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGAT ATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGACAACGG CTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACCCGATAC AGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGC GCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATGCACGGC TTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCC GCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTATG GCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACATCCTATCAAG CGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCATACGCC AAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATCACCTCC TCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGAC GGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA 1 HATNDDDVKK AATVAIAAAY NNGGEINGFK AGETIYDIDE DGTITKKDAT 51 AADVEADDFK GLGLKKVVTN LTKTVNENKG NVDAKVKAAE SEIEKLTTKL 101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKGTAEE TKGNVDAKVK 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNThKKANS 251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 301 VSDLRKETRG GLAEGAALSG LFQPYNVGGS GGGGSDLAND SFIRQVLDRG 351 HFEPDGKYHL FGSRGELAER SGHIGLGKIG SHQLGNLMIG GAAIKGNIGY 401 IVRFSDHGHE VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD 451 GYDGPGGGGY PAPKGARDIY SYDIKGVAGN IELNLTDNRS TGGELADRFH 501 NAGSMLTGGV GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE 551 IVGAGDAVGG ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAGLKDYAAA 601 AIRDWAVGNP NAAGGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK 651 RSGMGAIALP KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEGRYGKENI 701 TSSTVPPSNG KNVKLADGRH PKTGVPFDGK GFPNFEKHVK YDTLEHHHHH 751 H* 961c-741 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGGGGTGGTGTCGCCGCCGAC ATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTGACGCTGGAT CAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTGACAGCCTC AATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGGCAGCTCATT ACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAGCAAATACAA GATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATCTTTT GACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGAAAACTGACC TACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGACCTG GCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAAGCCGAGAAA GGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTAAACGGC ATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 1 MATNDDDVKK AATVAIAAAY NNGGEINGFK AGETIYDIDE DGTITKKDAT 51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDERLEAVA 151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKGTAEE TKGNVDAKVK 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 251 ADVYTREESD SKFVRIDGLN ATTEKLDTEL ASAEKSIADH DTRLNGLDKT 301 VSDLEREThG GLAEGAALSG LFQPYNVGGS GGGGVAADIG AGLADALTAP 351 LDHKDKGLGS LTLDGSVRKN EKLKLAAGGA EKTYGNGDSL NTGKLKNDKV 401 SRPDFIRGIE VDGGLITLES GEFGVYKGSH SALTAPGTEG IQDSEHSGKM 451 VAKRGFRIGD IAGEHTSFDK LPEGGRATYR GTAFGSDDAG GKLTYTIDFA 501 AKGGNGKIEH LKSPELNVDL AAADIKPDGK REAVISGSVL YNGAEKGSYS 551 LGIFGGKAGE VAGSAEVKTV NGIRHIGLAA KGLEHHHHHH * 961c-983 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATC AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCC GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAAC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGAT GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACA TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCAT GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAAT GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAA GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAA GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGAC AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTT GCAGAACAAGCCGCGCTCTCCCGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGCGGAGGCGGCACTTCTGCGCCC GACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTATCTTACGCC GGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACAGACAGGGAT GCCAAATCAATGCCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAGAATTTGATC AACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGCGAATCCGTC GGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTATACGGCGTAT ATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTTATAGAGACT GAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATTGGCGGGCGT TCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGATGAAACCAAG AACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTCAATAACAGT TTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGCCAAGCGTTG CTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAACCTGTCCTAC CACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACATATGCCCTA TTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAAAAGTTCAAA CGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCATGTGGTGC CTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCCTTTTCCGCA CCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGTACCACGTTG CTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCGGGTAAGGCC ATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCCTTC CGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAACAACACCTAT ACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTCGAAACCAAA GGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATCTGGCAGATACCGAC CAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTACACACGTTTG GGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAGGGGGCAGGC TATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTCTTCACAAAC ATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGACACGCTGTCC TATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTGAAACACGCC GTAGAACAGGGCGGCAGCTATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAGACG GTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCAGCC GTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGACAGTACCGCC GCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGTCTGCGCGTC ATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGTACCCAAACC GTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGCGAA AACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATCGGCTATCTC AAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAAGGCAGC GTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGATTTGACGGTC GAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGGAGCGGCAAC AGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCCGTCCTGTTT GCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCAGCA ACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAATTCGGCAAC GGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTAGGC TACCGGTTCCTCGAGCACCACCACCACCACCACTGA 1 MATNDDDVKK AATVAIAAAY NNGGEINGFK AGETIYDIDE DGTITKKDAT 51 AADVEADDFK GLGLKKVVTN LTKTVNENKG NVDAKVKAAE SEIEKLTTKL 101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKGTAEE TKGNVDAKVK 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 251 ADVYTREESD SKFVRIDGLN ATTEKLDTBL ASAEKSIADH DTRLNGLDKT 301 VSDLRKETRG GLAEGAALSG LFGPYNVGGS GGGGTSAPDF NAGGTGIGSN 351 SRATTAKSAA VSYAGIKNEM CKDRSHLCAG RDDVAVTDRD AKINAPPPNL 401 HTGDFPNPND AYKNLINLKP AIEAGYTGRG VEVGIVDTGE SVGSISFPEL 451 YGRKEHGYNB NYKNYTAYMR KEAPEDGGGK DIEASFDDEA VIETEAKPTD 501 IRHVKEIGHI DLVSHIIGGR SVDGRPAGGI APDATLHIMN TNDETKNEMM 551 VAAIRNAWVK LGERGVRIVN NSFGTTSRAG TADLFGIANS EEGYRGALLD 601 YSGGDKTDEG IRLMGGSDYG NLSYHIRNKN MLFIFSTGND AGAGPNTYAL 651 LPFYEKDAGK GIITVAGVDR SGEKFKRBHY GEPGTEPLEY GSNHCGITAM 701 WCLSAPYEAS VRFTRTNPIG IAGTSFSAPI VTGTAALLLG KYPWMSNDNL 751 RTTLLTTAGD IGAVGVDSKF GWGLLDAGKA MNGPASFPPG DFTADTKGTS 801 DIAYSFRNDI SGTGGLIKKG GSGLGLHGNN TYTGKTIIEG GSLVLYGNNK 851 SDNRVETKGA LIYNGAASGG SLNSDGIVYL ADTDGSGANE TVHIKGSLGL 901 DGKGTLYTRL GKLLKVDGTA IIGGKLYMSA RGKGAGYLNS TGRRVPFLSA 951 AKIGGDYSFF TNIETDGGLL ASLDSVEKTA GSBGDTLSYY VRRGNAARTA 1001 SAAAHSAPAG LKHAVEGGGS NLENLMVELD ASESSATPET VETAAADRTD 1051 MPGIRPYGAT FRAAAAVGHA NAADGVRIFN SLAATVYADS TAAHADMGGR 1101 RLKAVSDGLD HNGTGLRVIA GTGGDGGTWE GGGVEGKMRG STGTVGIAAK 1151 TGENTTAAAT LGMGRSTWSE NSANAKTDSI SLFAGIRHDA GDIGYLKGLF 1201 SYGRYRNSIS RSTGADEHAE GSVNGTLMGL GALGGVNVPF AATGDLTVEG 1251 GLRYDLLKGD AFAEKGSALG WSGNSLTEGT LVGLAGLKLS GPLSDKAVLF 1301 ATAGVEEDLN GEDYTVTGGF TGATAATGKT GAPNMPHTRL VAGLGADVEF 1351 GNGWNGLARY SYAGSKGYGN HSGRVGVGYR FLEHHHHHH* 961cL-ORF46.1 ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTCTGTAGCGGCGCACTGGCAGCCACAAAC GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAA GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCC GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTC GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCA GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAG ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTC AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAG ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCC GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCT ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTC AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCAC GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCC GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGAGGAGGATCAGATTTGGCAAACGATTCTTTT ATCCGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATACCACCTATTCGGCAGCAGGGGGGAACTTGCC GAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTTGGGCAACCTGATGATTCAACAGGCGGCCATT AAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCTTCGACAACCATGCCTCA CATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTTAGCCTTTACCGCATCCATTGGGACGGATACGAACAC CATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGATATATACAGCTAC GACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCACCGGACAACGGCTTGCCGACCGT TTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACCCGATACAGCCCCGAGCTG GACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGCGCGGCAGGAGAA ATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTGCTGTCATGCACGGCTTGGGTCTGCTT TCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCCGCAGCAGCCATC CGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTATGGCAGCCATCCCC ATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATCACGGCACATCCTATCAAGCGGTCGCAGATG GGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGCGGCATACGCCAAATACCCGTCC CCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGAAAACATCACCTCCTCAACCGTGCCG CCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGACGGTAAAGGGTTT CCGAATTTTGAGAAGCACGTGAAATATGATACGTAACTCGAG 1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGGEING 51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN 101 KGNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE APNDIADSLD ETNTKADEAV 201 KThNEAKGTA EETKGNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 301 RLASAEKSIA DHDThLNGLD KTVSDLRKBT RQGLAEQAAL SGLPGPYNVG 351 GSGGGGSDLA NDSFIRGVLD RGHFEPDGKY HLPGSRGELA ERSGHIGLGK 401 IGSHGLGNLM IQQAAIKGNI GYIVRFSDHG HEVHSPFDNH ASHSDSDHAG 451 SPVDGFSLYR IHWDGYBHHP ADGYDGPGGG GYPAPKGARD IYSYDIKGVA 501 GNIRLNLTDN RSTGQRLADR FHNAGSMLTQ GVGDGFKRAT RYSPELDRSG 551 NAAEAFNGTA DIVKNIIGAA GEIVGAGDAV GGISEGSNIA VMHGLGLLST 601 ENKMARINDL ADMAGLKDYA AAAIRDWAVG NPNAAGGIEA VSNIFMAAIP 651 IKGIGAVRGK YGLGGITAHP IKRSGMGAIA LPKGKSAVSD NFADAAYAKY 701 PSPYHSRNIR SNLEGRYGKE NITSSTVPPS NGKNVKLADG RHPKTGVPFD 751 GKGFPNFEKH VKYDT* 961cL-741 ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTCTGTAGCGGCGCACTGGCAGCCACAAAC GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAA GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCC GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTC GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCA GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAG ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTC AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAG ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCC GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCT ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTC AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCAC GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCC GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGGGGTGGTGTCGCCGCCGACATCGGTGCGGGG CTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAGTCTTTGACGCTGGATCAGTCCGTCAGG AAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTGACAGCCTCAATACGGGCAAA TTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGGCAGCTCATTACCTTGGAGAGT GGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGAGCAAATACAAGATTCGGAGCAT TCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATCTTTTGACAAGCTTCCC GAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGAAAACTGACCTACACCATAGAT TTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGACCTGGCCGCCGCCGAT ATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCTTTACAACCAAGCCGAGAAAGGCAGTTACTCC CTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTAAACGGCATACGCCATATC GGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGGEING 51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN 101 KGNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 151 TTFAEETKTN IVKIDEKLEA VADTVDKHAB AFNDIADSLD ETNTKADEAV 201 KTANEAKGTA EETKGNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 251 KVTDIKADIA TNKDNLAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 301 RLASABKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFGPYNVG 351 GSGGGGVAAD IGAGLADALT APLDHKDKGL GSLTLDGSVR KNEKLKLAAG 401 GAEKTYGNGD SLNTGKLKND KVSRFDFIRG IEVDGGLITL ESGEFGVYKG 451 SHSALTAPGT EGIGDSEHSG KMVAKRGFRI GDIAGEHTSF DKLPEGGRAT 501 YRGTAPGSDD AGGKLTYTID FAAKGGNGKI EHLKSPELNV DLAAADIKPD 551 GKRHAVISGS VLYNGAEKGS YSLGIFGGKA GEVAGSAEVK TVNGIRHIGL 601 AAKGLEHHHH HH* 961cL-983 ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTCTGTAGCGGCGCACTGGCAGCCACAAAC GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCCAAGAAATCAACGGTTTCAAA GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGCAGCCGATGTTGAAGCC GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAAACAAAACGTC GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATGCCGCTTTAGCA GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAAATATAACGACATTTGCTGAAGAG ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGCCGAAGCATTC AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCAAACAG ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCC GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGCT ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGCAAATTTGTC AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCATTGCCGATCAC GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCAGAACAAGCC GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGCGGAGGCGGCACTTCTGCGCCCGACTTCAATGCA GGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAGTATCTTACGCCGGTATCAAGAAC GAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTTGCGGTTACAGACAGGGATGCCAAAATCAAT GCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAGAATTTGATCAACCTCAAACCT GCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGACACAGGCGAATCCGTCGGCAGCATATCC TTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTATACGGCGTATATGCGGAAGGAA GCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTCGACGATGAGGCCGTTATAGAGACTGAAGCAAAGCCG ACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATATTATTGGCGGGCGTTCCGTGGACGGC AGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAATACGAATGATGAAACCAAGAACGAAATGATG GTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGAACGTGGCGTGCGCATCGTCAATAACAGTTTTGGAACAACA TCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATTCGGAGGAGCAGTACCGCCAAGCGTTGCTCGACTATTCC GGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTACGGCAACCTGTCCTACCACATCCGTAAT AAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACAAGCTCAGCCCAACACATATGCCCTATTGCCATTTTAT GAAAAAGACGCTCAAAAAGGCATTATCACAGTCGCAGGCGTAGACCGCAGTGGAGAAAAGTTCAAACGGGAAATGTAT GGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCATGTGGTGCCTGTCGGCACCC TATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAACATCCTTTTCCGCACCCATCGTAACC GGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTGCGTACCACGTTGCTGACGACGGCT CAGGACATCGGTGCAGTCGGCGTGGACAGCAAGTTCGGCTGGGGACTGCTGGATGCGGGTAAGGCCATGAACGGACCC GCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCCTTCCGTAACGACATT TCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAACTGCACGGCAACAACACCTATACGGGCAAAACC ATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCGGATATGCGCGTCGAAACCAAAGGTGCGCTGATT TATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGAGGGCATTGTCTATCTGGCAGATACCGACCAATCCGGCGCA AACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTACACACGTTTGGGCAAACTGCTG AAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAAGGGGGCAGGCTATCTCAACAGT ACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATTATTCTTTCTTCACAAACATCGAAACCGAC GGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGACACGCTGTCCTATTATGTCCGT CGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTCTGAAACACGCCGTAGAACAGGGC GGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACACCCGAGACGGTTGAAACTGCG GCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCAGCCGTACAGCATGCG AATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGACAGTACCGCCGCCCATGCCGAT ATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCACAACGGCACGGGTCTGCGCGTCATCGCGCAAACC CAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAGTACCCAAACCGTCGGCATTGCC GCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGCGAAAACAGTGCAAAT GCAAAAACCGACAGCATTAGTCTGTTTGCAGGCATACGGCACGATGCGGGCGATATCGGCTATCTCAAAGGCCTGTTC TCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAAGGCAGCGTCAACGGCACG CTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAGATTTGACGGTCGAAGGCGGTCTG CGCTACGACCTGCTCAAACAGGATGCATTCGCCGAAAAAGGCAGTGCTTTGGGCTGGAGCGGCAACAGCCTCACTGAA GGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGAGCGATAAAGCCGTCCTGTTTGCAACGGCGGGC GTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCAGCAACCGGCAAGACG GGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTCGAATTCGGCAACGGCTGGAACGGC TTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTAGGCTACCGGTTCTGA CTCGAG 1 MKHFPSKVLT TAILATFCSG MJAATNDDDV KKAATVAIAA AYNNGGEING 51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN 101 KGNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 151 TTFAEETKTN IVKIDEKLEA VADTVDKHAB AFNDIADSLD ETNTKADEAV 201 KTANEAKGTA EETKGNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RGGLAEGAAL SGLFGPYNVG 351 GSGGGGTSAP DFNAGGTGIG SNSRATTAKS AAVSYAGIKN EMCKDRSMLC 401 AGRDDVAVTD RDAKINAPPP NLHTGDFPNP NDAYKNLINL KPAIEAGYTG 451 RGVEVGIVDT GESVGSISFP ELYGRKEHGY NENYKNYTAY MRKEAPEDGG 501 GKDIEASFDD EAVIETEAKP TDIRHVKEIG HIDLVSHIIG GRSVDGRPAG 551 GIAPDATLHI MNTNDETKNE MMVAAIRNAW VKLGERGVRI VNNSFGTTSR 601 AGTADLPGIA NSEEGYRGAL LDYSGGDKTD EGIRLHGGSD YGNLSYHIRN 651 KNMLFIFSTG NDAGAGPNTY ALLPFYEKDA GKGIITVAGV DRSGEKFKRE 701 MYGEPGTEPL EYGSNHCGIT AMWCLSAPYE ASVRPTRTNP IGIAGTSFSA 751 PIVTGTAALL LGKYPWMSND NLRTTLLTTA GDIGAVGVDS KFGWGLLDAG 801 KAMNGPASFF FGDFTADTKG TSDIAYSFRN DISGTGGLIK KGGSGLGLHG 851 NNTYTGKTII EGGSLVLYGN NKSDMRVETK GALIYNGAAS GGSLNSDGIV 901 YLADTDGSGA NETVHIKGSL GLDGKGTLYT RLGKLLKVDG TAIIGGKLYM 951 SARGKGAGYL NSTGRRVPFL SAAKIGQDYS FFTNIETDGG LLASLDSVEK 1001 TAGSEGDTLS YYVRRGNAAR TASAAAHSAP AGLKHAVEGG GSNLENLMVE 1051 LDASESSATP ETVETAAADR TDMPGIRPYG ATFRAAAAVG HANAADGVRI 1101 FNSLAATVYA DSTAAHADMG GRRLKAVSDG LDHNGTGLRV IAGTGGDGGT 1151 WEGGGVEGKM RGSTGTVGIA AKTGENTTAA ATLGMGRSTW SENSANAKTD 1201 SISLFAGIRH DAGDIGYLKG LFSYGRYKNS ISRSTGADEH AEGSVNGTLM 1251 GLGALGGVNV PFAATGDLTV EGGLRYDLLK GDAFAEKGSA LGWSGNSLTB 1301 GTLVGLAGLK LSGPLSDKAV LFATAGVERD LNGRDYTVTG GFTGATAATG 1351 KTGAR~MPRT RLVAGLGADV EFGNGWNGLA RYSYAGSKGY GNHSGRVGVG 1401 YRF*

[0079] It will be understood that the invention has been described by way of example only and modifications may be made whilst remaining within the scope and spirit of the invention. For instance, the use of proteins from other strains is envisaged [e.g. see WO00/66741 for polymorphic sequences for ORF4, ORF40, ORF46, 225, 235, 287, 519, 726, 919 and 953].

[0080] Experimental Details

[0081] Cloning Strategy and Oligonucleonide Design

[0082] Genes coding for antigens of interest were amplified by PCR, using oligonucleotides designed on the basis of the genomic sequence of N. meningitidis B MC58. Genomic DNA from strain 2996 was always used as a template in PCR reactions, unless otherwise specified, and the amplified fragments were cloned in the expression vector pET21b+(Novagen) to express the protein as C-terminal His-tagged product, or in pET-24b+(Novagen) to express the protein in ‘untagged’ form (e.g. ΔG 287K).

[0083] Where a protein was expressed without a fusion partner and with its own leader peptide (if present), amplification of the open reading frame (ATG to STOP codons) was performed.

[0084] Where a protein was expressed in ‘untagged’ form, the leader peptide was omitted by designing the 5′-end amplification primer downstream from the predicted leader sequence.

[0085] The melting temperature of the primers used in PCR depended on the number and type of hybridising nucleotides in the whole primer, and was determined using the formulae: T_(m1) = 4 (G + C) + 2 (A + T) (tail excluded) T_(m2) = 64.9 + 0.41 (% GC) − 600/N (whole primer)

[0086] The melting temperatures of the selected oligonucleotides were usually 65-70° C. for the whole oligo and 50-60° C. for the hybridising region alone.

[0087] Oligonucleotides were synthesised using a Perkin Elmer 394 DNA/RNA Synthesizer, eluted from the columns in 2.0 ml NH₄OH, and deprotected by 5 hours incubation at 56° C. The oligos were precipitated by addition of 0.3M Na-Acetate and 2 volumes ethanol. The samples were centrifuged and the pellets resuspended in water. Restriction Sequences site fu (961)- Fwd CGCGGATCC-GGAGGGGGTGGTGTCG BamHI 741(MC58)-His Rev CCCGCTCGAG-TTGCTTGGCGGCAAGGC XhoI fu (961)-983-His Fwd CGCGGATCC-GGCGGAGGCGGCACTT BamHI Rev CCCGCTCGAG-GAACCGGTAGCCTACG XhoI fu (961)-Orf46.1- Fwd CGCGGATCCGGTGGTGGTGGT- BamHI His TCAGATTTGGCAAACGATTC Rev CCCGCTCGAG-CGTATCATATTTCACGTGC XhoI fu (961 c-L)- Fwd CGCGGATCC-GGAGGGGGTGGTGTCG BamHI 741(MC58) Rev CCCGCTCGAG-TTATTGCTTGGCGGCAAG XhoI fu (961c-L)-983 Fwd CGCGGATCC-GGCGGAGGCGGCACTT BamHI Rev CCCGCTCGAG-TCAGAACCGGTAGCCTAC XhoI fu (961c-L)- Fwd CGCGGATCCGGTGGTGGTGGT- BamHI Orf46.1 TCAGATTTGGCAAACGATTC Rev CCCGCTCGAG-TTACGTATCATATTTCACGTGC XhoI fu-(ΔG287)-919- Fwd CGCGGATCCGGTGGTGGTGGT- BamHI His CAAAGCAAGAGCATCCAAACC Rev CCCAAGCTT-TTCGGGCGGTATTCGGGCTTC HindIII fu-(ΔG287)-953- Fwd CGCGGATCCGGTGGTGGTGGT- BamHI His GCCACCTACAAAGTGGAC Rev GCCCAAGCTT-TTGTTTGGCTGCCTCGAT HindIII fu-(ΔG287)-961- Fwd CGCGGATCCGGTGGTGGTGGT-ACAAGCGACGACG BamHI His Rev GCCCAAGCTT-CCACTCGTAATTGACGCC HindIII fu-(ΔG287)- Fwd CGCGGATCCGGTGGTGGTGGT- BamHI Orf46.1-His TCAGATTTGGCAAACGATTC Rev CCCAAGCTT-CGTATCATATTTCACGTGC HindIII fu-(ΔG287-919)- Fwd CCCAAGCTTGGTGGTGGTGGTGGT- HindIII Orf46.1-His TCAGATTTGGCAAACGATTC Rev CCCGCTCGAG-CGTATCATATTTCACGTGC XhoI fu-(ΔG287- Fwd CCCAAGCTTGGTGGTGGTGGTGGT- HindIII Orf46.1)-919-His CAAAGCAAGAGCATCCAAACC Rev CCCGCTCGAG-CGGGCGGTATTCGGGCTT XhoI fu ΔG287(394.98)- Fwd CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC NheI . . . Rev CGGGGATCC-ATCCTGCTCTTTTTTGCCGG BamHI fu Orf1-(Orf46.1)- Fwd CGCGGATCCGCTAGC-GGACACACTTATTTCGGCATC NheI His Rev CGCGGATCC-CCAGCGGTAGCCTAATTTGAT fu (Orf1)-Orf46.1- Fwd CGCGGATCCGGTGGTGGTGGT- BamHI His TCAGATTTGGCAAACGATTC Rev CCCAAGCTT-CGTATCATATTTCACGTGC HindIII fu (919)-Orf46.1- Fwd1 GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAG SalI His Fwd2 GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC Rev CCCGCTCGAG-CGTATCATATTTCACGTGC XhoI Fu (orf46)-287-His Fwd CGGGGATCCGGGGGCGGCGGTGGCG BamHI Rev CCCAAGCTTATCCTGCTCTTTTTTGCCGGC HindIII Fu (orf46)-919-His Fwd CGCGGATCCGGTGGTGGTGGTCAAAGCAAGAGCATCCAAACC BamHI Rev CCCAAGCTTCGGGCGGTATTCGGGCTTC HindIII Fu (orf46-919)- Fwd CCCCAAGCTTGGGGGCGGCGGTGGCG HindIII 287-His Rev CCCGCTCGAGATCCTGCTCTTTTTTGCCGGC XhoI Fu (orf46-287)- Fwd CCCAAGCTTGGTGGTGGTGGTGGTCAAAGCAAGAGCATCCAAACC HindIII 919-His Rev CCCGCTCGAGCGGGCGGTATTCGGGCTT XhoI (ΔG741)-961c-His Fwd1 GGAGGCACTGGATCCGCAGCCACAAACGACGACGA XhoI Fwd2 GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG Rev CCCGCTCGAG-ACCCAGCTTGTAAGGTTG XhoI (ΔG741)-961-His Fwd1 GGAGGCACTGGATCCGCAGCCACAAACGACGACGA XhoI Fwd2 GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG Rev CCCGCTCGAG-CCACTCGTAATTGACGCC XhoI (ΔG741)-983-His Fwd GCGGCCTCGAG- XhoI GGATCCGGCGGAGGCGGCACTTCTGCG Rev CCCGCTCGAG-GAACCGGTAGCCTACG XhoI (ΔG741)-orf46.1- Fwd1 GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC SalI His Fwd2 GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA Rev CCCGCTCGAG-CGTATCATATTTCACGTGC XhoI (ΔG983)- Fwd GCGGCCTCGAG-GGATCCGGAGGGGGTGGTGTCGCC XhoI 741(MC58)-His Rev CCCGCTCGAG-TTGCTTGGCGGCAAG XhoI (ΔG983)-961c-His Fwd1 GGAGGCACTGGATCCGCAGCCACAAACGACGACGA XhoI Fwd2 GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG Rev CCCGCTCGAG-ACCCAGCTTGTAAGGTTG XhoI (ΔG983)-961-His Fwd1 GGAGGCACTGGATCCGCAGCCACAAACGACGACGA XhoI Fwd2 GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG Rev CCCGCTCGAG-CCACTCGTAATTGACGCC XhoI (ΔG983)-Orf46.1- Fwd1 GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC SalI His Fwd2 GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA Rev CCCGCTCGAG-CGTATCATATTTCACGTGC XhoI

[0088] In all constructs starting with an ATG not followed by a unique NheI site, the ATG codon is part of the NdeI site used for cloning. The constructs made using NheI as a cloning site at the 5′ end (e.g. all those containing 287 at the N-terminus) have two additional codons (GCT AGC) fused to the coding sequence of the antigen.

[0089] Preparation of Chromosomal DNA Templates

[0090]N. meningitidis strains 2996, MC58, 394.98, 1000 and BZ232 (and others) were grown to exponential phase in 100 ml of GC medium, harvested by centrifugation, and resuspended in 5 ml buffer (20% w/v sucrose, 50 mM Tris-HCl, 50 mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by adding 10 ml of lysis solution (50 mM NaCl, 1% Na-Sarkosyl, 50 μg/ml Proteinase K), and the suspension incubated at 37° C. for 2 hours. Two phenol extractions (equilibrated to pH 8) and one CHCl₃/isoamylalcohol (24:1) extraction were performed. DNA was precipitated by addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0 ml TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH 8.0). The DNA concentration was measured by reading OD₂₆₀.

[0091] PCR Amplfication

[0092] The standard PCR protocol was as follows: 200 ng of genomic DNA from 2996, MC581000, or BZ232 strains or long of plasmid DNA preparation of recombinant clones were used as template in the presence of 40 μM of each oligonucletide primer, 400-800 μM dNTPs solution, 1× PCR buffer (including 1.5 mM MgCl₂), 2.5 units TaqI DNA polymerase (using Perkin-Elmer AmpliTaQ, Boerhingher Mannheim Expand™ Long Template).

[0093] After a preliminary 3 minute incubation of the whole mix at 95° C., each sample underwent a two-step amplification: the first 5 cycles were performed using the hybridisation temperature that excluded the restriction enzyme tail of the primer (T_(m1)). This was followed by 30 cycles according to the hybridisation temperature calculated for the whole length oligos (T_(m2)). Elongation times, performed at 68° C. or 72° C., varied according to the length of the Orf to be amplified. In the case of Orf1 the elongation time, starting from 3 minutes, was increased by 15 seconds each cycle. The cycles were completed with a 10 minute extension step at 72° C.

[0094] The amplified DNA was either loaded directly on a 1% agarose gel. The DNA fragment corresponding to the band of correct size was purified from the gel using the Qiagen Gel Extraction Kit, following the manufacturer's protocol.

[0095] Digestion of PCR Fragments and of the Cloning Vectors

[0096] The purified DNA corresponding to the amplified fragment was digested with the appropriate restriction enzymes for cloning into pET-21b+, pET22b+or pET-24b+. Digested fragments were purified using the QIAquick PCR purification kit (following the manufacturer's instructions) and eluted with either H₂O or 10 mM Tris, pH 8.5. Plasmid vectors were digested with the appropriate restriction enzymes, loaded onto a 1.0% agarose gel and the band corresponding to the digested vector purified using the Qiagen QIAquick Gel Extraction Kit.

[0097] Cloning

[0098] The fragments corresponding to each gene, previously digested and purified, were ligated into pET21b+, pET22b+ or pET-24b+. A molar ratio of 3:1 fragment/vector was used with T4 DNA ligase in the ligation buffer supplied by the manufacturer.

[0099] Recombinant plasmid was transformed into competent E. coli DH5 or HB101 by incubating the ligase reaction solution and bacteria for 40 minutes on ice, then at 37° C. for 3 minutes. This was followed by the addition of 800 μl LB broth and incubation at 37° C. for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf microfuge, resuspended in approximately 200 μl of the supernatant and plated onto LB ampicillin (100 mg/ml) agar.

[0100] Screening for recombinant clones was performed by growing randomly selected colonies overnight at 37° C. in 4.0 ml of LB broth +100 μg/ml ampicillln. Cells were pelleted and plasmid DNA extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. Approximately 1 μg of each individual miniprep was digested with the appropriate restriction enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert size), in parallel with the molecular weight marker (1 kb DNA Ladder, GIBCO). Positive clones were selected on the basis of the size of insert.

[0101] Expression

[0102] After cloning each gene into the expression vector, recombinant plasmids were transformed into E. coli strains suitable for expression of the recombinant protein. 1 μl of each construct was used to transform E. coli BL21-DE3 as described above. Single recombinant colonies were inoculated into 2 ml LB+Amp (100 μg/ml), incubated at 37° C. overnight, then diluted 1:30 in 20 ml of LB+Amp (100 μg/ml) in 100 ml flasks, to give an OD₆₀₀ between 0.1 and 0.2. The flasks were incubated at 30° C. or at 37° C. in a gyratory water bath shaker until OD₆₀₀ indicated exponential growth suitable for induction of expression (0.40.8 OD). Protein expression was induced by addition of 11.0 mM IPTG. After 3 hours incubation at 30° C. or 37° C. the OD₆₀₀ was measured and expression examined. 1.0 ml of each sample was centrifuged in a microfuge, the pellet resuspended in PBS and analysed by SDS-PAGE and Coomassie Blue staining.

[0103] Purification of His-tagged Proteins

[0104] Various forms of 287 were cloned from strains 2996 and MC58. They were constructed with a C-terminus His-tagged fusion and included a mature form (aa 18-427), constructs with deletions (Δ1, Δ2, Δ3 and Δ4) and clones composed of either B or C domains. For each clone purified as a His-fusion, a single colony was streaked and grown overnight at 37° C. on a LB/Amp (100 μg/ml) agar plate. An isolated colony from this plate was inoculated into 20 ml of LB/Amp (100 μg/ml) liquid medium and grown overnight at 37° C. with shaking. The overnight culture was diluted 1:30 into 1.0 L LB/Amp (100 μg/ml) liquid medium and allowed to grow at the optimal temperature (30 or 37° C.) until the OD₅₅₀ reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG (final concentration 1.0 mM) and the culture incubated for a further 3 hours. Bacteria were harvested by centrifugation at 8000 g for 15 min at 4° C. The bacterial pellet was resuspended in 7.5 ml of either (i) cold buffer A (300 mM NaCl, 50 mM phosphate buffer, 10 mM imidazole, pH 8.0) for soluble proteins or (ii) buffer B (10 mM Tris-HCl, 100 mM phosphate buffer, pH 8.8 and, optionally, 8M urea) for insoluble proteins. Proteins purified in a soluble form included 287-His, Δ1, Δ2, Δ3 and Δ4287-His, Δ4287MC58-His, 287c-His and 287cMC58-His. Protein 287bMC58-His was insoluble and purified accordingly. Cells were disrupted by sonication on ice four times for 30 sec at 40 W using a Branson sonifier 450 and centrifuged at 13000 g for 30 min at 4° C. For insoluble proteins, pellets were resuspended in 2.0 ml buffer C (6 M guanidine hydrochloride, 100 mM phosphate buffer, 10 mM Tris-HCl, pH 7.5 and treated with 10 passes of a Dounce homogenizer. The homogenate was centrifuged at 13000 g for 30 min and the supernatant retained. Supernatants for both soluble and insoluble preparations were mixed with 150 μl Ni²⁺-resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast Flow (Pharmacia), prepared according to the manufacturer's protocol. The batch-wise preparation was centrifuged at 700 g for 5 min at 4° C. and the supernatant discarded. The resin was washed twice (batch-wise) with 10 ml buffer A or B for 10 min, resuspended in 1.0 ml buffer A or B and loaded onto a disposable column. The resin continued to be washed with either (i) buffer A at 4° C. or (id) buffer B at room temperature, until the OD₂₈₀ of the flow-through reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8.0) or (ii) buffer D (10 mM Tris-HCl, 100 mM phosphate buffer, pH 6.3 and, optionally, 8M urea) until OD₂₈₀ of the flow-through reached 0.02-0.01. The His-fusion protein was eluted by addition of 700 μl of either (i) cold elution buffer A (300 mM NaCl, 50 mM phosphate buffer, 250 mM imidazole, pH 8.0) or (ii) elution buffer B (10 mM Tris-HCl, 100 mM phosphate buffer, pH 4.5 and, optionally, 8M urea) and fractions collected until the OD₂₈₀ indicated all the recombinant protein was obtained. 20 μl aliquots of each elution fraction were analysed by SDS-PAGE. Protein concentrations were estimated using the Bradford assay.

[0105] Renaturation of Denatured His-fusion Proteins

[0106] Denaturation was required to solubilize 287bMC8, so a renaturation step was employed prior to immunisation. Glycerol was added to the denatured fractions obtained above to give a final concentration of 10% v/v. The proteins were diluted to 200 μg/ml using dialysis buffer I (10% v/v glycerol, 0.5M arginine, 50 mM phosphate buffer, 5.0 mM reduced glutathione, 0.5 mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same buffer for 12-14 hours at 4° C. Further dialysis was performed with buffer II (10% v/v glycerol, 0.5M arginine, 50 mM phosphate buffer, 5.0 mM reduced glutathione, 0.5 mM oxidised glutathione, pH 8.8) for 12-14 hours at 4° C. Protein concentration was estimated using the formula:

Protein (mg/ml)=(1.55×OD ₂₈₀)−(0.76×OD ₂₆₀)

[0107] Immunization

[0108] Balb/C mice were immunized with antigens on days 0, 21 and 35 and sera analyzed at day 49.

[0109] Sera Analysis—ELISA

[0110] The acapsulated MenB M7 and the capsulated strains were plated on chocolate agar plates and incubated overnight at 37° C. with 5% CO₂. Bacterial colonies were collected from the agar plates using a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following OD₆₂₀. The bacteria were let to grow until the OD reached the value of 0.4-0.5. The culture was centrifuged for 10 minutes at 4000 rpm. The supernatant was discarded and bacteria were washed twice with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37° C. and then overnight at 4° C. with stirring. 100 μl bacterial cells were added to each well of a 96 well Greiner plate and incubated overnight at 4° C. The wells were then washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200 μl of saturation buffer (2.7% polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours at 37° C. Wells were washed three times with PBT. 200 μl of diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaN₃ in PBS) were added to each well and the plates incubated for 2 hours at 37° C. Wells were washed three times with PBT. 100 μl of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1:2000 in dilution buffer were added to each well and the plates were incubated for 90 minutes at 37° C. Wells were washed three times with PBT buffer. 100 μl of substrate buffer for HRP (25 ml of citrate buffer pH5, 10 mg of O-phenildiamine and 1011 of H₂O₂) were added to each well and the plates were left at room temperature for 20 minutes. 100 μl 12.5% H₂SO₄ was added to each well and OD₄₉₀ was followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an OD₄₉₀ value of 0.4 above the level of preimmune sera. The ELISA was considered positive when the dilution of sera with OD₄₉₀ of 0.4 was higher than 1:400.

[0111] Sera Analysis—FACS Scan Bacteria Binding Assay

[0112] The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated overnight at 37° C. with 5% CO₂. Bacterial colonies were collected from the agar plates using a sterile dracon swab and inoculated into 4 tubes containing 8 ml each Mueller-Hinton Broth (Difco) containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following OD₆₂₀. The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture was centrifuged for 10 minutes at 4000 rpm. The supernatant was discarded and the pellet was resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN₃) and centrifuged for 5 minutes at 4000 rpm. Cells were resuspended in blocking buffer to reach OD₆₂₀ of 0.05. 100 μl bacterial cells were added to each well of a Costar 96 well plate. 100 μl of diluted (1:100, 1:200, 1:400) sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4° C. Cells were centrifuged for 5 minutes at 400 rpm, the supernatant aspirated and cells washed by addition of 200 μl/well of blocking buffer in each well. 100 μl of R-Phicoerytrin conjugated F(ab)₂ goat anti-mouse, diluted 1:100, was added to each well and plates incubated for 1 hour at 4° C. Cells were spun down by centrifugation at 4000 rpm for 5 minutes and washed by addition of 200 μl/well of blocking buffer. The supernatant was aspirated and cells resuspended in 200 μl/well of PBS, 0.25% formaldehyde. Samples were transferred to FACScan tubes and read. The condition for FACScan (Laser Power 15 mW) setting were: FL2 on; FSC—H threshold:92; FSC PMT Voltage: E 01; SSC PMr: 474; Amp. Gains 6.1; FL-2 PMT: 586; compensation values: 0.

[0113] Sera Analysis—Bactericidal Assay

[0114]N. meningitidis strain 2996 was grown overnight at 37° C. on chocolate agar plates (starting from a frozen stock) with 5% CO₂. Colonies were collected and used to inoculate 7 ml Mueller-Hinton broth, containing 0.25% glucose to reach an OD₆₂₀ of 0.05-0.08. The culture was incubated for approximately 1.5 hours at 37 degrees with shacking until the OD₆₂₀ reached the value of 0.23-0.24. Bacteria were diluted in 50 mM Phosphate buffer pH 7.2 containing 10 mM MgCl₂, 10 mM CaCl₂ and 0.5% (w/v) BSA (assay buffer) at the working dilution of 10⁵ CFU/ml. The total volume of the final reaction mixture was 50 μl with 25 μl of serial two fold dilution of test serum, 12.5 μl of bacteria at the working dilution, 12.5 μl of baby rabbit complement (final concentration 25%).

[0115] Controls included bacteria incubated with complement serum, immune sera incubated with bacteria and with complement inactivated by heating at 56° C. for 30′. Immediately after the addition of the baby rabbit complement, 10 μl of the controls were plated on Mueller-Hinton agar plates using the tilt method (time 0). The 96-wells plate was incubated for 1 hour at 37° C. with rotation. 7 μl of each sample were plated on Mueller-Hinton agar plates as spots, whereas 10 μl of the controls were plated on Mueller-Hinton agar plates using the tilt method (time 1). Agar plates were incubated for 18 hours at 37 degrees and the colonies corresponding to time 0 and time 1 were counted.

[0116] Sera Analysis—Western Blots

[0117] Purified proteins (500 ng/lane), outer membrane vesicles (5 μg) and total cell extracts (25 μg) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150 mA at 4° C., using transfer buffer (0.3% Tris base, 1.44% glycine, 20% (v/v) methanol). The membrane was saturated by overnight incubation at 4° C. in saturation buffer (10% skimmed milk, 0.1% Triton X100 in PBS). The membrane was washed twice with washing buffer (3% skimmed milk, 0.1% Triton X100 in PBS) and incubated for 2 hours at 37° C. with mice sera diluted 1:200 in washing buffer. The membrane was washed twice and incubated for 90 minutes with a 1:2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The membrane was washed twice with 0.1% Triton X100 in PBS and developed with the Opti-4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water.

[0118] The OMVs were prepared as follows: N. meningitidis strain 2996 was grown overnight at 37 degrees with 5% CO₂ on 5 GC plates, harvested with a loop and resuspended in 10 ml of 20 mM Tris-HCl pH 7.5, 2 mM EDTA. Heat inactivation was performed at 56° C. for 45 minutes and the bacteria disrupted by sonication for 5 minutes on ice (50% duty cycle, 50% output, Branson sonifier 3 mm microtip). Unbroken cells were removed by centrifugation at 5000 g for 10 minutes, the supernatant containing the total cell envelope fraction recovered and further centrifuged overnight at 50000 g at the temperature of 4° C. The pellet containing the membranes was resuspended in 2% sarkosyl, 20 mM Tris-HCl pH 7.5, 2 mM EDTA and incubated at room temperature for 20 minutes to solubilise the inner membranes. The suspension was centrifuged at 10000 g for 10 minutes to remove aggregates, the supernatant was further centrifuged at 50000 g for 3 hours. The pellet, containing the outer membranes was washed in PBS and resuspended in the same buffer. Protein concentration was measured by the D.C. Bio-Rad Protein assay (Modified Lowry method), using BSA as a standard.

[0119] Total cell extracts were prepared as follows: N. meningitidis strain 2996 was grown overnight on a GC plate, harvested with a loop and resuspended in 1 ml of 20 mM Tris-HCl. Heat inactivation was performed at 56° C. for 30 minutes.

1 121 1 608 PRT Neisseria meningitidis 1 Leu Gly Ile Ser Arg Lys Ile Ser Leu Ile Leu Ser Ile Leu Ala Val 1 5 10 15 Cys Leu Pro Met His Ala His Ala Ser Asp Leu Ala Asn Asp Ser Phe 20 25 30 Ile Arg Gln Val Leu Asp Arg Gln His Phe Glu Pro Asp Gly Lys Tyr 35 40 45 His Leu Phe Gly Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His Ile 50 55 60 Gly Leu Gly Lys Ile Gln Ser His Gln Leu Gly Asn Leu Met Ile Gln 65 70 75 80 Gln Ala Ala Ile Lys Gly Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp 85 90 95 His Gly His Glu Val His Ser Pro Phe Asp Asn His Ala Ser His Ser 100 105 110 Asp Ser Asp Glu Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg 115 120 125 Ile His Trp Asp Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly 130 135 140 Pro Gln Gly Gly Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr 145 150 155 160 Ser Tyr Asp Ile Lys Gly Val Ala Gln Asn Ile Arg Leu Asn Leu Thr 165 170 175 Asp Asn Arg Ser Thr Gly Gln Arg Leu Ala Asp Arg Phe His Asn Ala 180 185 190 Gly Ser Met Leu Thr Gln Gly Val Gly Asp Gly Phe Lys Arg Ala Thr 195 200 205 Arg Tyr Ser Pro Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe 210 215 220 Asn Gly Thr Ala Asp Ile Val Lys Asn Ile Ile Gly Ala Ala Gly Glu 225 230 235 240 Ile Val Gly Ala Gly Asp Ala Val Gln Gly Ile Ser Glu Gly Ser Asn 245 250 255 Ile Ala Val Met His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met 260 265 270 Ala Arg Ile Asn Asp Leu Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala 275 280 285 Ala Ala Ala Ile Arg Asp Trp Ala Val Gln Asn Pro Asn Ala Ala Gln 290 295 300 Gly Ile Glu Ala Val Ser Asn Ile Phe Met Ala Ala Ile Pro Ile Lys 305 310 315 320 Gly Ile Gly Ala Val Arg Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala 325 330 335 His Pro Ile Lys Arg Ser Gln Met Gly Ala Ile Ala Leu Pro Lys Gly 340 345 350 Lys Ser Ala Val Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr 355 360 365 Pro Ser Pro Tyr His Ser Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg 370 375 380 Tyr Gly Lys Glu Asn Ile Thr Ser Ser Thr Val Pro Pro Ser Asn Gly 385 390 395 400 Lys Asn Val Lys Leu Ala Asp Gln Arg His Pro Lys Thr Gly Val Pro 405 410 415 Phe Asp Gly Lys Gly Phe Pro Asn Phe Glu Lys His Val Lys Tyr Asp 420 425 430 Thr Lys Leu Asp Ile Gln Glu Leu Ser Gly Gly Gly Ile Pro Lys Ala 435 440 445 Lys Pro Val Ser Asp Ala Lys Pro Arg Trp Glu Val Asp Arg Lys Leu 450 455 460 Asn Lys Leu Thr Thr Arg Glu Gln Val Glu Lys Asn Val Gln Glu Ile 465 470 475 480 Arg Asn Gly Asn Lys Asn Ser Asn Phe Ser Gln His Ala Gln Leu Glu 485 490 495 Arg Glu Ile Asn Lys Leu Lys Ser Ala Asp Glu Ile Asn Phe Ala Asp 500 505 510 Gly Met Gly Lys Phe Thr Asp Ser Met Asn Asp Lys Ala Phe Ser Arg 515 520 525 Leu Val Lys Ser Val Lys Glu Asn Gly Phe Thr Asn Pro Val Val Glu 530 535 540 Tyr Val Glu Ile Asn Gly Lys Ala Tyr Ile Val Arg Gly Asn Asn Arg 545 550 555 560 Val Phe Ala Ala Glu Tyr Leu Gly Arg Ile His Glu Leu Lys Phe Lys 565 570 575 Lys Val Asp Phe Pro Val Pro Asn Thr Ser Trp Lys Asn Pro Thr Asp 580 585 590 Val Leu Asn Glu Ser Gly Asn Val Lys Arg Pro Arg Tyr Arg Ser Lys 595 600 605 2 464 PRT Artificial Sequence deltaG287 2 Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala Ala Pro 1 5 10 15 Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro Gln Ala 20 25 30 Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Ser Gln Asp Met 35 40 45 Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Val Thr Ala 50 55 60 Asp Asn Pro Lys Asn Glu Asp Glu Val Ala Gln Asn Asp Met Pro Gln 65 70 75 80 Asn Ala Ala Gly Thr Asp Ser Ser Thr Pro Asn His Thr Pro Asp Pro 85 90 95 Asn Met Leu Ala Gly Asn Met Glu Asn Gln Ala Thr Asp Ala Gly Glu 100 105 110 Ser Ser Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Ala Ala Asp Gly 115 120 125 Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Gln Asn Ala Gly Asn Thr 130 135 140 Ala Ala Gln Gly Ala Asn Gln Ala Gly Asn Asn Gln Ala Ala Gly Ser 145 150 155 160 Ser Asp Pro Ile Pro Ala Ser Asn Pro Ala Pro Ala Asn Gly Gly Ser 165 170 175 Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile Asp Gly Pro 180 185 190 Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys Ser Gly 195 200 205 Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe Glu Lys 210 215 220 Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly Lys Asn 225 230 235 240 Asp Lys Phe Val Gly Leu Val Ala Asp Ser Val Gln Met Lys Gly Ile 245 250 255 Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys Pro Thr Ser Phe Ala Arg 260 265 270 Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu Pro Ala Glu Met Pro 275 280 285 Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile Val Asp Gly Glu Ala 290 295 300 Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe Ala Pro Glu Gly Asn 305 310 315 320 Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu Pro Gly Gly Ser Tyr 325 330 335 Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly Glu Met Leu Ala Gly 340 345 350 Ala Ala Val Tyr Asn Gly Glu Val Leu His Phe His Thr Glu Asn Gly 355 360 365 Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala Lys Val Asp Phe Gly 370 375 380 Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly Asp Asp Leu His Met 385 390 395 400 Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly Asn Gly Phe Lys Gly 405 410 415 Thr Trp Thr Glu Asn Gly Ser Gly Asp Val Ser Gly Lys Phe Tyr Gly 420 425 430 Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser Tyr Arg Pro Thr Asp 435 440 445 Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly Lys Lys Glu Gln Asp 450 455 460 3 2505 DNA Artificial Sequence deltaG287-919 3 atggctagcc ccgatgttaa atcggcggac acgctgtcaa aaccggccgc tcctgttgtt 60 gctgaaaaag agacagaggt aaaagaagat gcgccacagg caggttctca aggacagggc 120 gcgccatcca cacaaggcag ccaagatatg gcggcagttt cggcagaaaa tacaggcaat 180 ggcggtgcgg caacaacgga caaacccaaa aatgaagacg agggaccgca aaatgatatg 240 ccgcaaaatt ccgccgaatc cgcaaatcaa acagggaaca accaacccgc cgattcttca 300 gattccgccc ccgcgtcaaa ccctgcacct gcgaatggcg gtagcaattt tggaagggtt 360 gatttggcta atggcgtttt gattgatggg ccgtcgcaaa atataacgtt gacccactgt 420 aaaggcgatt cttgtaatgg tgataattta ttggatgaag aagcaccgtc aaaatcagaa 480 tttgaaaatt taaatgagtc tgaacgaatt gagaaatata agaaagatgg gaaaagcgat 540 aaatttacta atttggttgc gacagcagtt caagctaatg gaactaacaa atatgtcatc 600 atttataaag acaagtccgc ttcatcttca tctgcgcgat tcaggcgttc tgcacggtcg 660 aggaggtcgc ttcctgccga gatgccgcta atccccgtca atcaggcgga tacgctgatt 720 gtcgatgggg aagcggtcag cctgacgggg cattccggca atatcttcgc gcccgaaggg 780 aattaccggt atctgactta cggggcggaa aaattgcccg gcggatcgta tgccctccgt 840 gtgcaaggcg aaccggcaaa aggcgaaatg cttgctggca cggccgtgta caacggcgaa 900 gtgctgcatt ttcatacgga aaacggccgt ccgtacccga ctagaggcag gtttgccgca 960 aaagtcgatt tcggcagcaa atctgtggac ggcattatcg acagcggcga tgatttgcat 1020 atgggtacgc aaaaattcaa agccgccatc gatggaaacg gctttaaggg gacttggacg 1080 gaaaatggcg gcggggatgt ttccggaagg ttttacggcc cggccggcga ggaagtggcg 1140 ggaaaataca gctatcgccc gacagatgcg gaaaagggcg gattcggcgt gtttgccggc 1200 aaaaaagagc aggatggatc cggaggagga ggatgccaaa gcaagagcat ccaaaccttt 1260 ccgcaacccg acacatccgt catcaacggc ccggaccggc cggtcggcat ccccgacccc 1320 gccggaacga cggtcggcgg cggcggggcc gtctataccg ttgtaccgca cctgtccctg 1380 ccccactggg cggcgcagga tttcgccaaa agcctgcaat ccttccgcct cggctgcgcc 1440 aatttgaaaa accgccaagg ctggcaggat gtgtgcgccc aagcctttca aacccccgtc 1500 cattcctttc aggcaaaaca gttttttgaa cgctatttca cgccgtggca ggttgcaggc 1560 aacggaagcc ttgccggtac ggttaccggc tattacgagc cggtgctgaa gggcgacgac 1620 aggcggacgg cacaagcccg cttcccgatt tacggtattc ccgacgattt tatctccgtc 1680 cccctgcctg ccggtttgcg gagcggaaaa gcccttgtcc gcatcaggca gacgggaaaa 1740 aacagcggca caatcgacaa taccggcggc acacataccg ccgacctctc ccgattcccc 1800 atcaccgcgc gcacaacggc aatcaaaggc aggtttgaag gaagccgctt cctcccctac 1860 cacacgcgca accaaatcaa cggcggcgcg cttgacggca aagccccgat actcggttac 1920 gccgaagacc ccgtcgaact tttttttatg cacatccaag gctcgggccg tctgaaaacc 1980 ccgtccggca aatacatccg catcggctat gccgacaaaa acgaacatcc ctacgtttcc 2040 atcggacgct atatggcgga caaaggctac ctcaagctcg ggcagacctc gatgcagggc 2100 atcaaagcct atatgcggca aaatccgcaa cgcctcgccg aagttttggg tcaaaacccc 2160 agctatatct ttttccgcga gcttgccgga agcagcaatg acggtcccgt cggcgcactg 2220 ggcacgccgt tgatggggga atatgccggc gcagtcgacc ggcactacat taccttgggc 2280 gcgcccttat ttgtcgccac cgcccatccg gttacccgca aagccctcaa ccgcctgatt 2340 atggcgcagg ataccggcag cgcgattaaa ggcgcggtgc gcgtggatta tttttgggga 2400 tacggcgacg aagccggcga acttgccggc aaacagaaaa ccacgggtta cgtctggcag 2460 ctcctaccca acggtatgaa gcccgaatac cgcccgtaac tcgag 2505 4 832 PRT Artificial Sequence deltaG287-919 4 Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 1 5 10 15 Ala Pro Val Val Ala Glu Lys Glu Thr Glu Val Lys Glu Asp Ala Pro 20 25 30 Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Thr Gln Gly Ser Gln 35 40 45 Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala 50 55 60 Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gln Asn Asp Met 65 70 75 80 Pro Gln Asn Ser Ala Glu Ser Ala Asn Gln Thr Gly Asn Asn Gln Pro 85 90 95 Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn 100 105 110 Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile 115 120 125 Asp Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser 130 135 140 Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu 145 150 155 160 Phe Glu Asn Leu Asn Glu Ser Glu Arg Ile Glu Lys Tyr Lys Lys Asp 165 170 175 Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gln Ala 180 185 190 Asn Gly Thr Asn Lys Tyr Val Ile Ile Tyr Lys Asp Lys Ser Ala Ser 195 200 205 Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu 210 215 220 Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile 225 230 235 240 Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe 245 250 255 Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu 260 265 270 Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly 275 280 285 Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe 290 295 300 His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala 305 310 315 320 Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly 325 330 335 Asp Asp Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly 340 345 350 Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser 355 360 365 Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser 370 375 380 Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly 385 390 395 400 Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Cys Gln Ser Lys Ser 405 410 415 Ile Gln Thr Phe Pro Gln Pro Asp Thr Ser Val Ile Asn Gly Pro Asp 420 425 430 Arg Pro Val Gly Ile Pro Asp Pro Ala Gly Thr Thr Val Gly Gly Gly 435 440 445 Gly Ala Val Tyr Thr Val Val Pro His Leu Ser Leu Pro His Trp Ala 450 455 460 Ala Gln Asp Phe Ala Lys Ser Leu Gln Ser Phe Arg Leu Gly Cys Ala 465 470 475 480 Asn Leu Lys Asn Arg Gln Gly Trp Gln Asp Val Cys Ala Gln Ala Phe 485 490 495 Gln Thr Pro Val His Ser Phe Gln Ala Lys Gln Phe Phe Glu Arg Tyr 500 505 510 Phe Thr Pro Trp Gln Val Ala Gly Asn Gly Ser Leu Ala Gly Thr Val 515 520 525 Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp Arg Arg Thr Ala 530 535 540 Gln Ala Arg Phe Pro Ile Tyr Gly Ile Pro Asp Asp Phe Ile Ser Val 545 550 555 560 Pro Leu Pro Ala Gly Leu Arg Ser Gly Lys Ala Leu Val Arg Ile Arg 565 570 575 Gln Thr Gly Lys Asn Ser Gly Thr Ile Asp Asn Thr Gly Gly Thr His 580 585 590 Thr Ala Asp Leu Ser Arg Phe Pro Ile Thr Ala Arg Thr Thr Ala Ile 595 600 605 Lys Gly Arg Phe Glu Gly Ser Arg Phe Leu Pro Tyr His Thr Arg Asn 610 615 620 Gln Ile Asn Gly Gly Ala Leu Asp Gly Lys Ala Pro Ile Leu Gly Tyr 625 630 635 640 Ala Glu Asp Pro Val Glu Leu Phe Phe Met His Ile Gln Gly Ser Gly 645 650 655 Arg Leu Lys Thr Pro Ser Gly Lys Tyr Ile Arg Ile Gly Tyr Ala Asp 660 665 670 Lys Asn Glu His Pro Tyr Val Ser Ile Gly Arg Tyr Met Ala Asp Lys 675 680 685 Gly Tyr Leu Lys Leu Gly Gln Thr Ser Met Gln Gly Ile Lys Ala Tyr 690 695 700 Met Arg Gln Asn Pro Gln Arg Leu Ala Glu Val Leu Gly Gln Asn Pro 705 710 715 720 Ser Tyr Ile Phe Phe Arg Glu Leu Ala Gly Ser Ser Asn Asp Gly Pro 725 730 735 Val Gly Ala Leu Gly Thr Pro Leu Met Gly Glu Tyr Ala Gly Ala Val 740 745 750 Asp Arg His Tyr Ile Thr Leu Gly Ala Pro Leu Phe Val Ala Thr Ala 755 760 765 His Pro Val Thr Arg Lys Ala Leu Asn Arg Leu Ile Met Ala Gln Asp 770 775 780 Thr Gly Ser Ala Ile Lys Gly Ala Val Arg Val Asp Tyr Phe Trp Gly 785 790 795 800 Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gln Lys Thr Thr Gly 805 810 815 Tyr Val Trp Gln Leu Leu Pro Asn Gly Met Lys Pro Glu Tyr Arg Pro 820 825 830 5 1746 DNA Artificial Sequence deltaG287-953 5 atggctagcc ccgatgttaa atcggcggac acgctgtcaa aaccggccgc tcctgttgtt 60 gctgaaaaag agacagaggt aaaagaagat gcgccacagg caggttctca aggacagggc 120 gcgccatcca cacaaggcag ccaagatatg gcggcagttt cggcagaaaa tacaggcaat 180 ggcggtgcgg caacaacgga caaacccaaa aatgaagacg agggaccgca aaatgatatg 240 ccgcaaaatt ccgccgaatc cgcaaatcaa acagggaaca accaacccgc cgattcttca 300 gattccgccc ccgcgtcaaa ccctgcacct gcgaatggcg gtagcaattt tggaagggtt 360 gatttggcta atggcgtttt gattgatggg ccgtcgcaaa atataacgtt gacccactgt 420 aaaggcgatt cttgtaatgg tgataattta ttggatgaag aagcaccgtc aaaatcagaa 480 tttgaaaatt taaatgagtc tgaacgaatt gagaaatata agaaagatgg gaaaagcgat 540 aaatttacta atttggttgc gacagcagtt caagctaatg gaactaacaa atatgtcatc 600 atttataaag acaagtccgc ttcatcttca tctgcgcgat tcaggcgttc tgcacggtcg 660 aggaggtcgc ttcctgccga gatgccgcta atccccgtca atcaggcgga tacgctgatt 720 gtcgatgggg aagcggtcag cctgacgggg cattccggca atatcttcgc gcccgaaggg 780 aattaccggt atctgactta cggggcggaa aaattgcccg gcggatcgta tgccctccgt 840 gtgcaaggcg aaccggcaaa aggcgaaatg cttgctggca cggccgtgta caacggcgaa 900 gtgctgcatt ttcatacgga aaacggccgt ccgtacccga ctagaggcag gtttgccgca 960 aaagtcgatt tcggcagcaa atctgtggac ggcattatcg acagcggcga tgatttgcat 1020 atgggtacgc aaaaattcaa agccgccatc gatggaaacg gctttaaggg gacttggacg 1080 gaaaatggcg gcggggatgt ttccggaagg ttttacggcc cggccggcga ggaagtggcg 1140 ggaaaataca gctatcgccc gacagatgcg gaaaagggcg gattcggcgt gtttgccggc 1200 aaaaaagagc aggatggatc cggaggagga ggagccacct acaaagtgga cgaatatcac 1260 gccaacgccc gtttcgccat cgaccatttc aacaccagca ccaacgtcgg cggtttttac 1320 ggtctgaccg gttccgtcga gttcgaccaa gcaaaacgcg acggtaaaat cgacatcacc 1380 atccccgttg ccaacctgca aagcggttcg caacacttta ccgaccacct gaaatcagcc 1440 gacatcttcg atgccgccca atatccggac atccgctttg tttccaccaa attcaacttc 1500 aacggcaaaa aactggtttc cgttgacggc aacctgacca tgcacggcaa aaccgccccc 1560 gtcaaactca aagccgaaaa attcaactgc taccaaagcc cgatggcgaa aaccgaagtt 1620 tgcggcggcg acttcagcac caccatcgac cgcaccaaat ggggcgtgga ctacctcgtt 1680 aacgttggta tgaccaaaag cgtccgcatc gacatccaaa tcgaggcagc caaacaataa 1740 ctcgag 1746 6 579 PRT Artificial Sequence deltaG287-953 6 Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 1 5 10 15 Ala Pro Val Val Ala Glu Lys Glu Thr Glu Val Lys Glu Asp Ala Pro 20 25 30 Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Thr Gln Gly Ser Gln 35 40 45 Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala 50 55 60 Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gln Asn Asp Met 65 70 75 80 Pro Gln Asn Ser Ala Glu Ser Ala Asn Gln Thr Gly Asn Asn Gln Pro 85 90 95 Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn 100 105 110 Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile 115 120 125 Asp Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser 130 135 140 Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu 145 150 155 160 Phe Glu Asn Leu Asn Glu Ser Glu Arg Ile Glu Lys Tyr Lys Lys Asp 165 170 175 Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gln Ala 180 185 190 Asn Gly Thr Asn Lys Tyr Val Ile Ile Tyr Lys Asp Lys Ser Ala Ser 195 200 205 Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu 210 215 220 Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile 225 230 235 240 Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe 245 250 255 Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu 260 265 270 Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly 275 280 285 Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe 290 295 300 His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala 305 310 315 320 Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly 325 330 335 Asp Asp Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly 340 345 350 Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser 355 360 365 Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser 370 375 380 Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly 385 390 395 400 Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Tyr Lys Val 405 410 415 Asp Glu Tyr His Ala Asn Ala Arg Phe Ala Ile Asp His Phe Asn Thr 420 425 430 Ser Thr Asn Val Gly Gly Phe Tyr Gly Leu Thr Gly Ser Val Glu Phe 435 440 445 Asp Gln Ala Lys Arg Asp Gly Lys Ile Asp Ile Thr Ile Pro Val Ala 450 455 460 Asn Leu Gln Ser Gly Ser Gln His Phe Thr Asp His Leu Lys Ser Ala 465 470 475 480 Asp Ile Phe Asp Ala Ala Gln Tyr Pro Asp Ile Arg Phe Val Ser Thr 485 490 495 Lys Phe Asn Phe Asn Gly Lys Lys Leu Val Ser Val Asp Gly Asn Leu 500 505 510 Thr Met His Gly Lys Thr Ala Pro Val Lys Leu Lys Ala Glu Lys Phe 515 520 525 Asn Cys Tyr Gln Ser Pro Met Ala Lys Thr Glu Val Cys Gly Gly Asp 530 535 540 Phe Ser Thr Thr Ile Asp Arg Thr Lys Trp Gly Val Asp Tyr Leu Val 545 550 555 560 Asn Val Gly Met Thr Lys Ser Val Arg Ile Asp Ile Gln Ile Glu Ala 565 570 575 Ala Lys Gln 7 2388 DNA Artificial Sequence deltaG287-961 7 atggctagcc ccgatgttaa atcggcggac acgctgtcaa aaccggccgc tcctgttgtt 60 gctgaaaaag agacagaggt aaaagaagat gcgccacagg caggttctca aggacagggc 120 gcgccatcca cacaaggcag ccaagatatg gcggcagttt cggcagaaaa tacaggcaat 180 ggcggtgcgg caacaacgga caaacccaaa aatgaagacg agggaccgca aaatgatatg 240 ccgcaaaatt ccgccgaatc cgcaaatcaa acagggaaca accaacccgc cgattcttca 300 gattccgccc ccgcgtcaaa ccctgcacct gcgaatggcg gtagcaattt tggaagggtt 360 gatttggcta atggcgtttt gattgatggg ccgtcgcaaa atataacgtt gacccactgt 420 aaaggcgatt cttgtaatgg tgataattta ttggatgaag aagcaccgtc aaaatcagaa 480 tttgaaaatt taaatgagtc tgaacgaatt gagaaatata agaaagatgg gaaaagcgat 540 aaatttacta atttggttgc gacagcagtt caagctaatg gaactaacaa atatgtcatc 600 atttataaag acaagtccgc ttcatcttca tctgcgcgat tcaggcgttc tgcacggtcg 660 aggaggtcgc ttcctgccga gatgccgcta atccccgtca atcaggcgga tacgctgatt 720 gtcgatgggg aagcggtcag cctgacgggg cattccggca atatcttcgc gcccgaaggg 780 aattaccggt atctgactta cggggcggaa aaattgcccg gcggatcgta tgccctccgt 840 gtgcaaggcg aaccggcaaa aggcgaaatg cttgctggca cggccgtgta caacggcgaa 900 gtgctgcatt ttcatacgga aaacggccgt ccgtacccga ctagaggcag gtttgccgca 960 aaagtcgatt tcggcagcaa atctgtggac ggcattatcg acagcggcga tgatttgcat 1020 atgggtacgc aaaaattcaa agccgccatc gatggaaacg gctttaaggg gacttggacg 1080 gaaaatggcg gcggggatgt ttccggaagg ttttacggcc cggccggcga ggaagtggcg 1140 ggaaaataca gctatcgccc gacagatgcg gaaaagggcg gattcggcgt gtttgccggc 1200 aaaaaagagc aggatggatc cggaggagga ggagccacaa acgacgacga tgttaaaaaa 1260 gctgccactg tggccattgc tgctgcctac aacaatggcc aagaaatcaa cggtttcaaa 1320 gctggagaga ccatctacga cattgatgaa gacggcacaa ttaccaaaaa agacgcaact 1380 gcagccgatg ttgaagccga cgactttaaa ggtctgggtc tgaaaaaagt cgtgactaac 1440 ctgaccaaaa ccgtcaatga aaacaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1500 tctgaaatag aaaagttaac aaccaagtta gcagacactg atgccgcttt agcagatact 1560 gatgccgctc tggatgcaac caccaacgcc ttgaataaat tgggagaaaa tataacgaca 1620 tttgctgaag agactaagac aaatatcgta aaaattgatg aaaaattaga agccgtggct 1680 gataccgtcg acaagcatgc cgaagcattc aacgatatcg ccgattcatt ggatgaaacc 1740 aacactaagg cagacgaagc cgtcaaaacc gccaatgaag ccaaacagac ggccgaagaa 1800 accaaacaaa acgtcgatgc caaagtaaaa gctgcagaaa ctgcagcagg caaagccgaa 1860 gctgccgctg gcacagctaa tactgcagcc gacaaggccg aagctgtcgc tgcaaaagtt 1920 accgacatca aagctgatat cgctacgaac aaagataata ttgctaaaaa agcaaacagt 1980 gccgacgtgt acaccagaga agagtctgac agcaaatttg tcagaattga tggtctgaac 2040 gctactaccg aaaaattgga cacacgcttg gcttctgctg aaaaatccat tgccgatcac 2100 gatactcgcc tgaacggttt ggataaaaca gtgtcagacc tgcgcaaaga aacccgccaa 2160 ggccttgcag aacaagccgc gctctccggt ctgttccaac cttacaacgt gggtcggttc 2220 aatgtaacgg ctgcagtcgg cggctacaaa tccgaatcgg cagtcgccat cggtaccggc 2280 ttccgcttta ccgaaaactt tgccgccaaa gcaggcgtgg cagtcggcac ttcgtccggt 2340 tcttccgcag cctaccatgt cggcgtcaat tacgagtggt aactcgag 2388 8 793 PRT Artificial Sequence deltaG287-961 8 Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 1 5 10 15 Ala Pro Val Val Ala Glu Lys Glu Thr Glu Val Lys Glu Asp Ala Pro 20 25 30 Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Thr Gln Gly Ser Gln 35 40 45 Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala 50 55 60 Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gln Asn Asp Met 65 70 75 80 Pro Gln Asn Ser Ala Glu Ser Ala Asn Gln Thr Gly Asn Asn Gln Pro 85 90 95 Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn 100 105 110 Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu Ile 115 120 125 Asp Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser 130 135 140 Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu 145 150 155 160 Phe Glu Asn Leu Asn Glu Ser Glu Arg Ile Glu Lys Tyr Lys Lys Asp 165 170 175 Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gln Ala 180 185 190 Asn Gly Thr Asn Lys Tyr Val Ile Ile Tyr Lys Asp Lys Ser Ala Ser 195 200 205 Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu 210 215 220 Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu Ile 225 230 235 240 Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile Phe 245 250 255 Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu 260 265 270 Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ala Lys Gly 275 280 285 Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe 290 295 300 His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala 305 310 315 320 Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser Gly 325 330 335 Asp Asp Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp Gly 340 345 350 Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser 355 360 365 Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser 370 375 380 Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly 385 390 395 400 Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Asn Asp Asp 405 410 415 Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn 420 425 430 Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile 435 440 445 Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val 450 455 460 Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn 465 470 475 480 Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val 485 490 495 Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp 500 505 510 Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr 515 520 525 Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu 530 535 540 Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala 545 550 555 560 Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser 565 570 575 Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn 580 585 590 Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys 595 600 605 Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly 610 615 620 Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val 625 630 635 640 Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys 645 650 655 Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys 660 665 670 Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr 675 680 685 Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu 690 695 700 Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln 705 710 715 720 Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn 725 730 735 Val Gly Arg Phe Asn Val Thr Ala Ala Val Gly Gly Tyr Lys Ser Glu 740 745 750 Ser Ala Val Ala Ile Gly Thr Gly Phe Arg Phe Thr Glu Asn Phe Ala 755 760 765 Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser Gly Ser Ser Ala Ala 770 775 780 Tyr His Val Gly Val Asn Tyr Glu Trp 785 790 9 2700 DNA Artificial Sequence deltaG287NZ-919 9 atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc ccctgttgtt 60 tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 120 gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 180 ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 240 ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 300 ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 360 caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 420 gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 480 ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 540 ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 600 acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 660 aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 720 aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 780 ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 840 cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 900 gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 960 ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 1020 tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 1080 gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 1140 ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 1200 ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 1260 aaggggactt ggacggaaaa tggcggcggg gatgtttccg gaaagtttta cggcccggcc 1320 ggcgaggaag tggcgggaaa atacagctat cgcccaacag atgcggaaaa gggcggattc 1380 ggcgtgtttg ccggcaaaaa agagcaggat ggatccggag gaggaggatg ccaaagcaag 1440 agcatccaaa cctttccgca acccgacaca tccgtcatca acggcccgga ccggccggtc 1500 ggcatccccg accccgccgg aacgacggtc ggcggcggcg gggccgtcta taccgttgta 1560 ccgcacctgt ccctgcccca ctgggcggcg caggatttcg ccaaaagcct gcaatccttc 1620 cgcctcggct gcgccaattt gaaaaaccgc caaggctggc aggatgtgtg cgcccaagcc 1680 tttcaaaccc ccgtccattc ctttcaggca aaacagtttt ttgaacgcta tttcacgccg 1740 tggcaggttg caggcaacgg aagccttgcc ggtacggtta ccggctatta cgagccggtg 1800 ctgaagggcg acgacaggcg gacggcacaa gcccgcttcc cgatttacgg tattcccgac 1860 gattttatct ccgtccccct gcctgccggt ttgcggagcg gaaaagccct tgtccgcatc 1920 aggcagacgg gaaaaaacag cggcacaatc gacaataccg gcggcacaca taccgccgac 1980 ctctcccgat tccccatcac cgcgcgcaca acggcaatca aaggcaggtt tgaaggaagc 2040 cgcttcctcc cctaccacac gcgcaaccaa atcaacggcg gcgcgcttga cggcaaagcc 2100 ccgatactcg gttacgccga agaccccgtc gaactttttt ttatgcacat ccaaggctcg 2160 ggccgtctga aaaccccgtc cggcaaatac atccgcatcg gctatgccga caaaaacgaa 2220 catccctacg tttccatcgg acgctatatg gcggacaaag gctacctcaa gctcgggcag 2280 acctcgatgc agggcatcaa agcctatatg cggcaaaatc cgcaacgcct cgccgaagtt 2340 ttgggtcaaa accccagcta tatctttttc cgcgagcttg ccggaagcag caatgacggt 2400 cccgtcggcg cactgggcac gccgttgatg ggggaatatg ccggcgcagt cgaccggcac 2460 tacattacct tgggcgcgcc cttatttgtc gccaccgccc atccggttac ccgcaaagcc 2520 ctcaaccgcc tgattatggc gcaggatacc ggcagcgcga ttaaaggcgc ggtgcgcgtg 2580 gattattttt ggggatacgg cgacgaagcc ggcgaacttg ccggcaaaca gaaaaccacg 2640 ggttacgtct ggcagctcct acccaacggt atgaagcccg aataccgccc gtaaaagctt 2700 10 897 PRT Artificial Sequence deltaG287NZ-919 10 Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 1 5 10 15 Ala Pro Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro 20 25 30 Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Gly Gln 35 40 45 Asp Met Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Ala 50 55 60 Ala Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Ala Gln Asn Asp Met 65 70 75 80 Pro Gln Asn Ala Ala Asp Thr Asp Ser Leu Thr Pro Asn His Thr Pro 85 90 95 Ala Ser Asn Met Pro Ala Gly Asn Met Glu Asn Gln Ala Pro Asp Ala 100 105 110 Gly Glu Ser Glu Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Thr Ala 115 120 125 Asp Gly Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Glu Asn Ala Gly 130 135 140 Asn Thr Ala Ala Gln Gly Thr Asn Gln Ala Glu Asn Asn Gln Thr Ala 145 150 155 160 Gly Ser Gln Asn Pro Ala Ser Ser Thr Asn Pro Ser Ala Thr Asn Ser 165 170 175 Gly Gly Asp Phe Gly Arg Thr Asn Val Gly Asn Ser Val Val Ile Asp 180 185 190 Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys 195 200 205 Ser Gly Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe 210 215 220 Glu Lys Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly 225 230 235 240 Lys Asn Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala Asp Ser 245 250 255 Val Gln Met Lys Gly Ile Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys 260 265 270 Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser 275 280 285 Leu Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu 290 295 300 Ile Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile 305 310 315 320 Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys 325 330 335 Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ser Lys 340 345 350 Gly Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His 355 360 365 Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala 370 375 380 Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser 385 390 395 400 Gly Asp Gly Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp 405 410 415 Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val 420 425 430 Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr 435 440 445 Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala 450 455 460 Gly Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Cys Gln Ser Lys 465 470 475 480 Ser Ile Gln Thr Phe Pro Gln Pro Asp Thr Ser Val Ile Asn Gly Pro 485 490 495 Asp Arg Pro Val Gly Ile Pro Asp Pro Ala Gly Thr Thr Val Gly Gly 500 505 510 Gly Gly Ala Val Tyr Thr Val Val Pro His Leu Ser Leu Pro His Trp 515 520 525 Ala Ala Gln Asp Phe Ala Lys Ser Leu Gln Ser Phe Arg Leu Gly Cys 530 535 540 Ala Asn Leu Lys Asn Arg Gln Gly Trp Gln Asp Val Cys Ala Gln Ala 545 550 555 560 Phe Gln Thr Pro Val His Ser Phe Gln Ala Lys Gln Phe Phe Glu Arg 565 570 575 Tyr Phe Thr Pro Trp Gln Val Ala Gly Asn Gly Ser Leu Ala Gly Thr 580 585 590 Val Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp Arg Arg Thr 595 600 605 Ala Gln Ala Arg Phe Pro Ile Tyr Gly Ile Pro Asp Asp Phe Ile Ser 610 615 620 Val Pro Leu Pro Ala Gly Leu Arg Ser Gly Lys Ala Leu Val Arg Ile 625 630 635 640 Arg Gln Thr Gly Lys Asn Ser Gly Thr Ile Asp Asn Thr Gly Gly Thr 645 650 655 His Thr Ala Asp Leu Ser Arg Phe Pro Ile Thr Ala Arg Thr Thr Ala 660 665 670 Ile Lys Gly Arg Phe Glu Gly Ser Arg Phe Leu Pro Tyr His Thr Arg 675 680 685 Asn Gln Ile Asn Gly Gly Ala Leu Asp Gly Lys Ala Pro Ile Leu Gly 690 695 700 Tyr Ala Glu Asp Pro Val Glu Leu Phe Phe Met His Ile Gln Gly Ser 705 710 715 720 Gly Arg Leu Lys Thr Pro Ser Gly Lys Tyr Ile Arg Ile Gly Tyr Ala 725 730 735 Asp Lys Asn Glu His Pro Tyr Val Ser Ile Gly Arg Tyr Met Ala Asp 740 745 750 Lys Gly Tyr Leu Lys Leu Gly Gln Thr Ser Met Gln Gly Ile Lys Ala 755 760 765 Tyr Met Arg Gln Asn Pro Gln Arg Leu Ala Glu Val Leu Gly Gln Asn 770 775 780 Pro Ser Tyr Ile Phe Phe Arg Glu Leu Ala Gly Ser Ser Asn Asp Gly 785 790 795 800 Pro Val Gly Ala Leu Gly Thr Pro Leu Met Gly Glu Tyr Ala Gly Ala 805 810 815 Val Asp Arg His Tyr Ile Thr Leu Gly Ala Pro Leu Phe Val Ala Thr 820 825 830 Ala His Pro Val Thr Arg Lys Ala Leu Asn Arg Leu Ile Met Ala Gln 835 840 845 Asp Thr Gly Ser Ala Ile Lys Gly Ala Val Arg Val Asp Tyr Phe Trp 850 855 860 Gly Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gln Lys Thr Thr 865 870 875 880 Gly Tyr Val Trp Gln Leu Leu Pro Asn Gly Met Lys Pro Glu Tyr Arg 885 890 895 Pro 11 1941 DNA Artificial Sequence deltaG287NZ-953 11 atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc ccctgttgtt 60 tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 120 gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 180 ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 240 ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 300 ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 360 caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 420 gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 480 ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 540 ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 600 acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 660 aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 720 aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 780 ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 840 cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 900 gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 960 ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 1020 tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 1080 gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 1140 ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 1200 ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 1260 aaggggactt ggacggaaaa tggcggcggg gatgtttccg gaaagtttta cggcccggcc 1320 ggcgaggaag tggcgggaaa atacagctat cgcccaacag atgcggaaaa gggcggattc 1380 ggcgtgtttg ccggcaaaaa agagcaggat ggatccggag gaggaggagc cacctacaaa 1440 gtggacgaat atcacgccaa cgcccgtttc gccatcgacc atttcaacac cagcaccaac 1500 gtcggcggtt tttacggtct gaccggttcc gtcgagttcg accaagcaaa acgcgacggt 1560 aaaatcgaca tcaccatccc cgttgccaac ctgcaaagcg gttcgcaaca ctttaccgac 1620 cacctgaaat cagccgacat cttcgatgcc gcccaatatc cggacatccg ctttgtttcc 1680 accaaattca acttcaacgg caaaaaactg gtttccgttg acggcaacct gaccatgcac 1740 ggcaaaaccg cccccgtcaa actcaaagcc gaaaaattca actgctacca aagcccgatg 1800 gcgaaaaccg aagtttgcgg cggcgacttc agcaccacca tcgaccgcac caaatggggc 1860 gtggactacc tcgttaacgt tggtatgacc aaaagcgtcc gcatcgacat ccaaatcgag 1920 gcagccaaac aataaaagct t 1941 12 644 PRT Artificial Sequence deltaG287NZ-953 12 Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 1 5 10 15 Ala Pro Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro 20 25 30 Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Gly Gln 35 40 45 Asp Met Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Ala 50 55 60 Ala Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Ala Gln Asn Asp Met 65 70 75 80 Pro Gln Asn Ala Ala Asp Thr Asp Ser Leu Thr Pro Asn His Thr Pro 85 90 95 Ala Ser Asn Met Pro Ala Gly Asn Met Glu Asn Gln Ala Pro Asp Ala 100 105 110 Gly Glu Ser Glu Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Thr Ala 115 120 125 Asp Gly Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Glu Asn Ala Gly 130 135 140 Asn Thr Ala Ala Gln Gly Thr Asn Gln Ala Glu Asn Asn Gln Thr Ala 145 150 155 160 Gly Ser Gln Asn Pro Ala Ser Ser Thr Asn Pro Ser Ala Thr Asn Ser 165 170 175 Gly Gly Asp Phe Gly Arg Thr Asn Val Gly Asn Ser Val Val Ile Asp 180 185 190 Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys 195 200 205 Ser Gly Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe 210 215 220 Glu Lys Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly 225 230 235 240 Lys Asn Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala Asp Ser 245 250 255 Val Gln Met Lys Gly Ile Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys 260 265 270 Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser 275 280 285 Leu Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu 290 295 300 Ile Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile 305 310 315 320 Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys 325 330 335 Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ser Lys 340 345 350 Gly Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His 355 360 365 Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala 370 375 380 Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser 385 390 395 400 Gly Asp Gly Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp 405 410 415 Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val 420 425 430 Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr 435 440 445 Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala 450 455 460 Gly Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Tyr Lys 465 470 475 480 Val Asp Glu Tyr His Ala Asn Ala Arg Phe Ala Ile Asp His Phe Asn 485 490 495 Thr Ser Thr Asn Val Gly Gly Phe Tyr Gly Leu Thr Gly Ser Val Glu 500 505 510 Phe Asp Gln Ala Lys Arg Asp Gly Lys Ile Asp Ile Thr Ile Pro Val 515 520 525 Ala Asn Leu Gln Ser Gly Ser Gln His Phe Thr Asp His Leu Lys Ser 530 535 540 Ala Asp Ile Phe Asp Ala Ala Gln Tyr Pro Asp Ile Arg Phe Val Ser 545 550 555 560 Thr Lys Phe Asn Phe Asn Gly Lys Lys Leu Val Ser Val Asp Gly Asn 565 570 575 Leu Thr Met His Gly Lys Thr Ala Pro Val Lys Leu Lys Ala Glu Lys 580 585 590 Phe Asn Cys Tyr Gln Ser Pro Met Ala Lys Thr Glu Val Cys Gly Gly 595 600 605 Asp Phe Ser Thr Thr Ile Asp Arg Thr Lys Trp Gly Val Asp Tyr Leu 610 615 620 Val Asn Val Gly Met Thr Lys Ser Val Arg Ile Asp Ile Gln Ile Glu 625 630 635 640 Ala Ala Lys Gln 13 2583 DNA Artificial Sequence deltaG287NZ-961 13 atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc ccctgttgtt 60 tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 120 gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 180 ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 240 ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 300 ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 360 caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 420 gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 480 ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 540 ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 600 acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 660 aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 720 aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 780 ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 840 cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 900 gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 960 ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 1020 tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 1080 gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 1140 ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 1200 ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 1260 aaggggactt ggacggaaaa tggcggcggg gatgtttccg gaaagtttta cggcccggcc 1320 ggcgaggaag tggcgggaaa atacagctat cgcccaacag atgcggaaaa gggcggattc 1380 ggcgtgtttg ccggcaaaaa agagcaggat ggatccggag gaggaggagc cacaaacgac 1440 gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1500 atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1560 aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1620 aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1680 gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1740 gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1800 gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1860 ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1920 tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1980 cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 2040 gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 2100 gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 2160 aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2220 attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2280 tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2340 aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2400 aacgtgggtc ggttcaatgt aacggctgca gtcggcggct acaaatccga atcggcagtc 2460 gccatcggta ccggcttccg ctttaccgaa aactttgccg ccaaagcagg cgtggcagtc 2520 ggcacttcgt ccggttcttc cgcagcctac catgtcggcg tcaattacga gtggtaaaag 2580 ctt 2583 14 858 PRT Artificial Sequence deltaG287NZ-961 14 Met Ala Ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 1 5 10 15 Ala Pro Val Val Ser Glu Lys Glu Thr Glu Ala Lys Glu Asp Ala Pro 20 25 30 Gln Ala Gly Ser Gln Gly Gln Gly Ala Pro Ser Ala Gln Gly Gly Gln 35 40 45 Asp Met Ala Ala Val Ser Glu Glu Asn Thr Gly Asn Gly Gly Ala Ala 50 55 60 Ala Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Ala Gln Asn Asp Met 65 70 75 80 Pro Gln Asn Ala Ala Asp Thr Asp Ser Leu Thr Pro Asn His Thr Pro 85 90 95 Ala Ser Asn Met Pro Ala Gly Asn Met Glu Asn Gln Ala Pro Asp Ala 100 105 110 Gly Glu Ser Glu Gln Pro Ala Asn Gln Pro Asp Met Ala Asn Thr Ala 115 120 125 Asp Gly Met Gln Gly Asp Asp Pro Ser Ala Gly Gly Glu Asn Ala Gly 130 135 140 Asn Thr Ala Ala Gln Gly Thr Asn Gln Ala Glu Asn Asn Gln Thr Ala 145 150 155 160 Gly Ser Gln Asn Pro Ala Ser Ser Thr Asn Pro Ser Ala Thr Asn Ser 165 170 175 Gly Gly Asp Phe Gly Arg Thr Asn Val Gly Asn Ser Val Val Ile Asp 180 185 190 Gly Pro Ser Gln Asn Ile Thr Leu Thr His Cys Lys Gly Asp Ser Cys 195 200 205 Ser Gly Asn Asn Phe Leu Asp Glu Glu Val Gln Leu Lys Ser Glu Phe 210 215 220 Glu Lys Leu Ser Asp Ala Asp Lys Ile Ser Asn Tyr Lys Lys Asp Gly 225 230 235 240 Lys Asn Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala Asp Ser 245 250 255 Val Gln Met Lys Gly Ile Asn Gln Tyr Ile Ile Phe Tyr Lys Pro Lys 260 265 270 Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser 275 280 285 Leu Pro Ala Glu Met Pro Leu Ile Pro Val Asn Gln Ala Asp Thr Leu 290 295 300 Ile Val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly Asn Ile 305 310 315 320 Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys 325 330 335 Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gln Gly Glu Pro Ser Lys 340 345 350 Gly Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His 355 360 365 Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala 370 375 380 Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly Ile Ile Asp Ser 385 390 395 400 Gly Asp Gly Leu His Met Gly Thr Gln Lys Phe Lys Ala Ala Ile Asp 405 410 415 Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val 420 425 430 Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr 435 440 445 Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala 450 455 460 Gly Lys Lys Glu Gln Asp Gly Ser Gly Gly Gly Gly Ala Thr Asn Asp 465 470 475 480 Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn 485 490 495 Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp 500 505 510 Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp 515 520 525 Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr 530 535 540 Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys 545 550 555 560 Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala 565 570 575 Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr 580 585 590 Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu 595 600 605 Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val 610 615 620 Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp 625 630 635 640 Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala 645 650 655 Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala 660 665 670 Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala 675 680 685 Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys 690 695 700 Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala 705 710 715 720 Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser 725 730 735 Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp 740 745 750 Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg 755 760 765 Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg 770 775 780 Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr 785 790 795 800 Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val Gly Gly Tyr Lys Ser 805 810 815 Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg Phe Thr Glu Asn Phe 820 825 830 Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser Gly Ser Ser Ala 835 840 845 Ala Tyr His Val Gly Val Asn Tyr Glu Trp 850 855 15 1082 PRT Artificial Sequence 983 15 Met Arg Thr Thr Pro Thr Phe Pro Thr Lys Thr Phe Lys Pro Thr Ala 1 5 10 15 Met Ala Leu Ala Val Ala Thr Thr Leu Ser Ala Cys Leu Gly Gly Gly 20 25 30 Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile 35 40 45 Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr 50 55 60 Ala Gly Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala 65 70 75 80 Gly Arg Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala 85 90 95 Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala 100 105 110 Tyr Lys Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr 115 120 125 Gly Arg Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly 130 135 140 Ser Ile Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn 145 150 155 160 Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu 165 170 175 Asp Gly Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val 180 185 190 Ile Glu Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile 195 200 205 Gly His Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp 210 215 220 Gly Arg Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met 225 230 235 240 Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg 245 250 255 Asn Ala Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn 260 265 270 Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile 275 280 285 Ala Asn Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly 290 295 300 Gly Asp Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr 305 310 315 320 Gly Asn Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe 325 330 335 Ser Thr Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu 340 345 350 Pro Phe Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly 355 360 365 Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro 370 375 380 Gly Thr Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala 385 390 395 400 Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg 405 410 415 Thr Asn Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val 420 425 430 Thr Gly Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn 435 440 445 Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala 450 455 460 Val Gly Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys 465 470 475 480 Ala Met Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp 485 490 495 Thr Lys Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser 500 505 510 Gly Thr Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His 515 520 525 Gly Asn Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu 530 535 540 Val Leu Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly 545 550 555 560 Ala Leu Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp 565 570 575 Gly Ile Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr 580 585 590 Val His Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr 595 600 605 Thr Arg Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly 610 615 620 Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn 625 630 635 640 Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln 645 650 655 Asp Tyr Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala 660 665 670 Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu 675 680 685 Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala 690 695 700 Ala His Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly 705 710 715 720 Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser 725 730 735 Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met 740 745 750 Pro Gly Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val 755 760 765 Gln His Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala 770 775 780 Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly 785 790 795 800 Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly 805 810 815 Leu Arg Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln 820 825 830 Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile 835 840 845 Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met 850 855 860 Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser 865 870 875 880 Ile Ser Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr 885 890 895 Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg 900 905 910 Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu 915 920 925 Met Gln Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr 930 935 940 Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln 945 950 955 960 Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser 965 970 975 Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln 980 985 990 Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg 995 1000 1005 Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala 1010 1015 1020 Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg 1025 1030 1035 1040 Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn 1045 1050 1055 Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His 1060 1065 1070 Ser Gly Arg Val Gly Val Gly Tyr Arg Phe 1075 1080 16 1047 PRT Artificial Sequence deltaG983 16 Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser Asn 1 5 10 15 Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly Ile 20 25 30 Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg Asp 35 40 45 Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro Pro 50 55 60 Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys Asn 65 70 75 80 Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg Gly 85 90 95 Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile Ser 100 105 110 Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn Tyr 115 120 125 Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly Gly 130 135 140 Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu Thr 145 150 155 160 Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His Ile 165 170 175 Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg Pro 180 185 190 Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr Asn 195 200 205 Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala Trp 210 215 220 Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe Gly 225 230 235 240 Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn Ser 245 250 255 Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp Lys 260 265 270 Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn Leu 275 280 285 Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr Gly 290 295 300 Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe Tyr 305 310 315 320 Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp Arg 325 330 335 Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr Glu 340 345 350 Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp Cys 355 360 365 Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn Pro 370 375 380 Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly Thr 385 390 395 400 Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn Leu 405 410 415 Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly Val 420 425 430 Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met Asn 435 440 445 Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys Gly 450 455 460 Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr Gly 465 470 475 480 Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn Asn 485 490 495 Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu Tyr 500 505 510 Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu Ile 515 520 525 Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile Val 530 535 540 Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His Ile 545 550 555 560 Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg Leu 565 570 575 Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys Leu 580 585 590 Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr Gly 595 600 605 Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr Ser 610 615 620 Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu Asp 625 630 635 640 Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr Tyr 645 650 655 Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His Ser 660 665 670 Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn Leu 675 680 685 Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr Pro 690 695 700 Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly Ile 705 710 715 720 Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His Ala 725 730 735 Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr Val 740 745 750 Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg Leu 755 760 765 Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg Val 770 775 780 Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly Val 785 790 795 800 Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala Lys 805 810 815 Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg Ser 820 825 830 Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser Leu 835 840 845 Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys Gly 850 855 860 Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr Gly 865 870 875 880 Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln Leu 885 890 895 Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp Leu 900 905 910 Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala Phe 915 920 925 Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr Glu 930 935 940 Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu Ser 945 950 955 960 Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu Asn 965 970 975 Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala Ala 980 985 990 Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val Ala 995 1000 1005 Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu Ala 1010 1015 1020 Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly Arg 1025 1030 1035 1040 Val Gly Val Gly Tyr Arg Phe 1045 17 4425 DNA Artificial Sequence deltaG983-ORF46.1 17 atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60 acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120 agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180 aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240 aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300 ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360 aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420 cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480 actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540 tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600 gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660 atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720 ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780 taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840 atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900 atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960 tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020 aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080 cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140 acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200 acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260 ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320 ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380 gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440 ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500 ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560 cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620 agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680 atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740 ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800 ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860 gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920 gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980 ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040 gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100 tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160 atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220 gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280 catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340 acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400 gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460 aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520 aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580 ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640 ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700 ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760 tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820 aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880 agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940 tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000 aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060 tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120 cgagtcggcg taggctaccg gttcctcgac ggtggcggag gcactggatc ctcagatttg 3180 gcaaacgatt cttttatccg gcaggttctc gaccgtcagc atttcgaacc cgacgggaaa 3240 taccacctat tcggcagcag gggggaactt gccgagcgca gcggccatat cggattggga 3300 aaaatacaaa gccatcagtt gggcaacctg atgattcaac aggcggccat taaaggaaat 3360 atcggctaca ttgtccgctt ttccgatcac gggcacgaag tccattcccc cttcgacaac 3420 catgcctcac attccgattc tgatgaagcc ggtagtcccg ttgacggatt tagcctttac 3480 cgcatccatt gggacggata cgaacaccat cccgccgacg gctatgacgg gccacagggc 3540 ggcggctatc ccgctcccaa aggcgcgagg gatatataca gctacgacat aaaaggcgtt 3600 gcccaaaata tccgcctcaa cctgaccgac aaccgcagca ccggacaacg gcttgccgac 3660 cgtttccaca atgccggtag tatgctgacg caaggagtag gcgacggatt caaacgcgcc 3720 acccgataca gccccgagct ggacagatcg ggcaatgccg ccgaagcctt caacggcact 3780 gcagatatcg ttaaaaacat catcggcgcg gcaggagaaa ttgtcggcgc aggcgatgcc 3840 gtgcagggca taagcgaagg ctcaaacatt gctgtcatgc acggcttggg tctgctttcc 3900 accgaaaaca agatggcgcg catcaacgat ttggcagata tggcgcaact caaagactat 3960 gccgcagcag ccatccgcga ttgggcagtc caaaacccca atgccgcaca aggcatagaa 4020 gccgtcagca atatctttat ggcagccatc cccatcaaag ggattggagc tgttcgggga 4080 aaatacggct tgggcggcat cacggcacat cctatcaagc ggtcgcagat gggcgcgatc 4140 gcattgccga aagggaaatc cgccgtcagc gacaattttg ccgatgcggc atacgccaaa 4200 tacccgtccc cttaccattc ccgaaatatc cgttcaaact tggagcagcg ttacggcaaa 4260 gaaaacatca cctcctcaac cgtgccgccg tcaaacggca aaaatgtcaa actggcagac 4320 caacgccacc cgaagacagg cgtaccgttt gacggtaaag ggtttccgaa ttttgagaag 4380 cacgtgaaat atgatacgct cgagcaccac caccaccacc actga 4425 18 1474 PRT Artificial Sequence deltaG983-ORF46.1 18 Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser 1 5 10 15 Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 20 25 30 Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 35 40 45 Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro 50 55 60 Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 65 70 75 80 Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg 85 90 95 Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile 100 105 110 Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 115 120 125 Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 130 135 140 Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu 145 150 155 160 Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His 165 170 175 Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg 180 185 190 Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr 195 200 205 Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala 210 215 220 Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe 225 230 235 240 Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn 245 250 255 Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp 260 265 270 Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn 275 280 285 Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr 290 295 300 Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe 305 310 315 320 Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp 325 330 335 Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 340 345 350 Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp 355 360 365 Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn 370 375 380 Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly 385 390 395 400 Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn 405 410 415 Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly 420 425 430 Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 435 440 445 Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 450 455 460 Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr 465 470 475 480 Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn 485 490 495 Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu 500 505 510 Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu 515 520 525 Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile 530 535 540 Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His 545 550 555 560 Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg 565 570 575 Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys 580 585 590 Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr 595 600 605 Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr 610 615 620 Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu 625 630 635 640 Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr 645 650 655 Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His 660 665 670 Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn 675 680 685 Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr 690 695 700 Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 705 710 715 720 Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His 725 730 735 Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr 740 745 750 Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg 755 760 765 Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 770 775 780 Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly 785 790 795 800 Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala 805 810 815 Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 820 825 830 Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser 835 840 845 Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys 850 855 860 Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr 865 870 875 880 Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln 885 890 895 Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp 900 905 910 Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala 915 920 925 Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 930 935 940 Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu 945 950 955 960 Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu 965 970 975 Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 980 985 990 Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 995 1000 1005 Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu 1010 1015 1020 Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly 1025 1030 1035 1040 Arg Val Gly Val Gly Tyr Arg Phe Leu Asp Gly Gly Gly Gly Thr Gly 1045 1050 1055 Ser Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg 1060 1065 1070 Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly 1075 1080 1085 Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser 1090 1095 1100 His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn 1105 1110 1115 1120 Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser 1125 1130 1135 Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser 1140 1145 1150 Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu 1155 1160 1165 His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro 1170 1175 1180 Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val 1185 1190 1195 1200 Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln 1205 1210 1215 Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly 1220 1225 1230 Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp 1235 1240 1245 Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val 1250 1255 1260 Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala 1265 1270 1275 1280 Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu 1285 1290 1295 Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala 1300 1305 1310 Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp 1315 1320 1325 Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn 1330 1335 1340 Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly 1345 1350 1355 1360 Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln 1365 1370 1375 Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn 1380 1385 1390 Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg 1395 1400 1405 Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr 1410 1415 1420 Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp 1425 1430 1435 1440 Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro 1445 1450 1455 Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu His His His His 1460 1465 1470 His His 19 3939 DNA Artificial Sequence deltaG983-741 19 atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60 acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120 agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180 aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240 aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300 ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360 aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420 cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480 actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540 tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600 gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660 atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720 ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780 taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840 atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900 atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960 tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020 aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080 cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140 acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200 acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260 ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320 ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380 gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440 ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500 ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560 cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620 agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680 atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740 ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800 ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860 gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920 gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980 ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040 gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100 tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160 atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220 gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280 catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340 acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400 gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460 aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520 aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580 ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640 ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700 ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760 tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820 aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880 agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940 tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000 aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060 tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120 cgagtcggcg taggctaccg gttcctcgag ggatccggag ggggtggtgt cgccgccgac 3180 atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 3240 cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 3300 ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 3360 aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 3420 gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 3480 gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 3540 ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 3600 tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 3660 ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 3720 gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 3780 gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 3840 caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 3900 gccgccaagc aactcgagca ccaccaccac caccactga 3939 20 1312 PRT Artificial Sequence deltaG983-741 20 Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser 1 5 10 15 Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 20 25 30 Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 35 40 45 Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro 50 55 60 Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 65 70 75 80 Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg 85 90 95 Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile 100 105 110 Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 115 120 125 Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 130 135 140 Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu 145 150 155 160 Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His 165 170 175 Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg 180 185 190 Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr 195 200 205 Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala 210 215 220 Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe 225 230 235 240 Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn 245 250 255 Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp 260 265 270 Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn 275 280 285 Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr 290 295 300 Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe 305 310 315 320 Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp 325 330 335 Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 340 345 350 Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp 355 360 365 Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn 370 375 380 Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly 385 390 395 400 Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn 405 410 415 Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly 420 425 430 Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 435 440 445 Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 450 455 460 Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr 465 470 475 480 Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn 485 490 495 Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu 500 505 510 Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu 515 520 525 Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile 530 535 540 Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His 545 550 555 560 Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg 565 570 575 Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys 580 585 590 Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr 595 600 605 Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr 610 615 620 Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu 625 630 635 640 Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr 645 650 655 Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His 660 665 670 Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn 675 680 685 Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr 690 695 700 Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 705 710 715 720 Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His 725 730 735 Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr 740 745 750 Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg 755 760 765 Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 770 775 780 Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly 785 790 795 800 Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala 805 810 815 Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 820 825 830 Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser 835 840 845 Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys 850 855 860 Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr 865 870 875 880 Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln 885 890 895 Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp 900 905 910 Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala 915 920 925 Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 930 935 940 Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu 945 950 955 960 Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu 965 970 975 Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 980 985 990 Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 995 1000 1005 Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu 1010 1015 1020 Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly 1025 1030 1035 1040 Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Ser Gly Gly Gly Gly 1045 1050 1055 Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro 1060 1065 1070 Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser 1075 1080 1085 Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys 1090 1095 1100 Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp 1105 1110 1115 1120 Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln 1125 1130 1135 Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His 1140 1145 1150 Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His 1155 1160 1165 Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala 1170 1175 1180 Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr 1185 1190 1195 1200 Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr 1205 1210 1215 Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His 1220 1225 1230 Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys 1235 1240 1245 Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn 1250 1255 1260 Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala 1265 1270 1275 1280 Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg 1285 1290 1295 His Ile Gly Leu Ala Ala Lys Gln Leu Glu His His His His His His 1300 1305 1310 21 4344 DNA Artificial Sequence deltaG983-961 21 atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60 acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120 agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180 aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240 aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300 ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360 aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420 cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480 actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540 tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600 gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660 atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720 ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780 taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840 atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900 atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960 tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020 aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080 cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140 acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200 acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260 ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320 ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380 gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440 ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500 ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560 cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620 agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680 atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740 ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800 ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860 gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920 gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980 ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040 gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100 tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160 atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220 gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280 catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340 acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400 gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460 aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520 aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580 ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640 ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700 ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760 tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820 aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880 agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940 tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000 aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060 tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120 cgagtcggcg taggctaccg gttcctcgag ggtggcggag gcactggatc cgccacaaac 3180 gacgacgatg ttaaaaaagc tgccactgtg gccattgctg ctgcctacaa caatggccaa 3240 gaaatcaacg gtttcaaagc tggagagacc atctacgaca ttgatgaaga cggcacaatt 3300 accaaaaaag acgcaactgc agccgatgtt gaagccgacg actttaaagg tctgggtctg 3360 aaaaaagtcg tgactaacct gaccaaaacc gtcaatgaaa acaaacaaaa cgtcgatgcc 3420 aaagtaaaag ctgcagaatc tgaaatagaa aagttaacaa ccaagttagc agacactgat 3480 gccgctttag cagatactga tgccgctctg gatgcaacca ccaacgcctt gaataaattg 3540 ggagaaaata taacgacatt tgctgaagag actaagacaa atatcgtaaa aattgatgaa 3600 aaattagaag ccgtggctga taccgtcgac aagcatgccg aagcattcaa cgatatcgcc 3660 gattcattgg atgaaaccaa cactaaggca gacgaagccg tcaaaaccgc caatgaagcc 3720 aaacagacgg ccgaagaaac caaacaaaac gtcgatgcca aagtaaaagc tgcagaaact 3780 gcagcaggca aagccgaagc tgccgctggc acagctaata ctgcagccga caaggccgaa 3840 gctgtcgctg caaaagttac cgacatcaaa gctgatatcg ctacgaacaa agataatatt 3900 gctaaaaaag caaacagtgc cgacgtgtac accagagaag agtctgacag caaatttgtc 3960 agaattgatg gtctgaacgc tactaccgaa aaattggaca cacgcttggc ttctgctgaa 4020 aaatccattg ccgatcacga tactcgcctg aacggtttgg ataaaacagt gtcagacctg 4080 cgcaaagaaa cccgccaagg ccttgcagaa caagccgcgc tctccggtct gttccaacct 4140 tacaacgtgg gtcggttcaa tgtaacggct gcagtcggcg gctacaaatc cgaatcggca 4200 gtcgccatcg gtaccggctt ccgctttacc gaaaactttg ccgccaaagc aggcgtggca 4260 gtcggcactt cgtccggttc ttccgcagcc taccatgtcg gcgtcaatta cgagtggctc 4320 gagcaccacc accaccacca ctga 4344 22 1447 PRT Artificial Sequence deltaG983-961 22 Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser 1 5 10 15 Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 20 25 30 Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 35 40 45 Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro 50 55 60 Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 65 70 75 80 Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg 85 90 95 Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile 100 105 110 Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 115 120 125 Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 130 135 140 Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu 145 150 155 160 Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His 165 170 175 Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg 180 185 190 Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr 195 200 205 Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala 210 215 220 Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe 225 230 235 240 Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn 245 250 255 Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp 260 265 270 Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn 275 280 285 Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr 290 295 300 Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe 305 310 315 320 Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp 325 330 335 Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 340 345 350 Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp 355 360 365 Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn 370 375 380 Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly 385 390 395 400 Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn 405 410 415 Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly 420 425 430 Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 435 440 445 Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 450 455 460 Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr 465 470 475 480 Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn 485 490 495 Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu 500 505 510 Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu 515 520 525 Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile 530 535 540 Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His 545 550 555 560 Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg 565 570 575 Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys 580 585 590 Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr 595 600 605 Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr 610 615 620 Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu 625 630 635 640 Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr 645 650 655 Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His 660 665 670 Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn 675 680 685 Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr 690 695 700 Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 705 710 715 720 Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His 725 730 735 Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr 740 745 750 Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg 755 760 765 Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 770 775 780 Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly 785 790 795 800 Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala 805 810 815 Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 820 825 830 Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser 835 840 845 Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys 850 855 860 Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr 865 870 875 880 Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln 885 890 895 Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp 900 905 910 Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala 915 920 925 Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 930 935 940 Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu 945 950 955 960 Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu 965 970 975 Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 980 985 990 Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 995 1000 1005 Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu 1010 1015 1020 Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly 1025 1030 1035 1040 Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Gly Gly Gly Thr Gly 1045 1050 1055 Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1060 1065 1070 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 1075 1080 1085 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 1090 1095 1100 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 1105 1110 1115 1120 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 1125 1130 1135 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 1140 1145 1150 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 1155 1160 1165 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 1170 1175 1180 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 1185 1190 1195 1200 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 1205 1210 1215 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 1220 1225 1230 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 1235 1240 1245 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 1250 1255 1260 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 1265 1270 1275 1280 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 1285 1290 1295 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 1300 1305 1310 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 1315 1320 1325 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 1330 1335 1340 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 1345 1350 1355 1360 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 1365 1370 1375 Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val 1380 1385 1390 Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg 1395 1400 1405 Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser 1410 1415 1420 Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Leu 1425 1430 1435 1440 Glu His His His His His His 1445 23 4179 DNA Artificial Sequence deltaG983-961c 23 atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60 acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120 agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180 aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240 aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300 ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360 aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420 cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480 actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540 tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600 gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660 atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720 ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780 taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840 atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900 atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960 tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020 aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080 cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140 acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200 acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260 ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320 ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380 gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440 ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500 ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560 cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620 agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680 atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740 ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800 ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860 gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920 gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980 ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040 gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100 tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160 atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220 gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280 catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340 acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400 gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460 aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520 aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580 ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640 ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700 ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760 tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820 aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880 agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940 tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 3000 aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 3060 tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 3120 cgagtcggcg taggctaccg gttcctcgag ggtggcggag gcactggatc cgccacaaac 3180 gacgacgatg ttaaaaaagc tgccactgtg gccattgctg ctgcctacaa caatggccaa 3240 gaaatcaacg gtttcaaagc tggagagacc atctacgaca ttgatgaaga cggcacaatt 3300 accaaaaaag acgcaactgc agccgatgtt gaagccgacg actttaaagg tctgggtctg 3360 aaaaaagtcg tgactaacct gaccaaaacc gtcaatgaaa acaaacaaaa cgtcgatgcc 3420 aaagtaaaag ctgcagaatc tgaaatagaa aagttaacaa ccaagttagc agacactgat 3480 gccgctttag cagatactga tgccgctctg gatgcaacca ccaacgcctt gaataaattg 3540 ggagaaaata taacgacatt tgctgaagag actaagacaa atatcgtaaa aattgatgaa 3600 aaattagaag ccgtggctga taccgtcgac aagcatgccg aagcattcaa cgatatcgcc 3660 gattcattgg atgaaaccaa cactaaggca gacgaagccg tcaaaaccgc caatgaagcc 3720 aaacagacgg ccgaagaaac caaacaaaac gtcgatgcca aagtaaaagc tgcagaaact 3780 gcagcaggca aagccgaagc tgccgctggc acagctaata ctgcagccga caaggccgaa 3840 gctgtcgctg caaaagttac cgacatcaaa gctgatatcg ctacgaacaa agataatatt 3900 gctaaaaaag caaacagtgc cgacgtgtac accagagaag agtctgacag caaatttgtc 3960 agaattgatg gtctgaacgc tactaccgaa aaattggaca cacgcttggc ttctgctgaa 4020 aaatccattg ccgatcacga tactcgcctg aacggtttgg ataaaacagt gtcagacctg 4080 cgcaaagaaa cccgccaagg ccttgcagaa caagccgcgc tctccggtct gttccaacct 4140 tacaacgtgg gtctcgagca ccaccaccac caccactga 4179 24 1392 PRT Artificial Sequence deltaG983-961c 24 Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser 1 5 10 15 Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 20 25 30 Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 35 40 45 Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro 50 55 60 Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 65 70 75 80 Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg 85 90 95 Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile 100 105 110 Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 115 120 125 Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 130 135 140 Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu 145 150 155 160 Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His 165 170 175 Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg 180 185 190 Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr 195 200 205 Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala 210 215 220 Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe 225 230 235 240 Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn 245 250 255 Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp 260 265 270 Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn 275 280 285 Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr 290 295 300 Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe 305 310 315 320 Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp 325 330 335 Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 340 345 350 Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp 355 360 365 Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn 370 375 380 Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly 385 390 395 400 Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn 405 410 415 Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly 420 425 430 Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 435 440 445 Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 450 455 460 Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr 465 470 475 480 Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn 485 490 495 Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu 500 505 510 Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu 515 520 525 Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile 530 535 540 Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His 545 550 555 560 Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg 565 570 575 Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys 580 585 590 Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr 595 600 605 Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr 610 615 620 Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu 625 630 635 640 Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr 645 650 655 Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His 660 665 670 Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn 675 680 685 Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr 690 695 700 Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 705 710 715 720 Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His 725 730 735 Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr 740 745 750 Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg 755 760 765 Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 770 775 780 Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly 785 790 795 800 Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala 805 810 815 Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 820 825 830 Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser 835 840 845 Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys 850 855 860 Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr 865 870 875 880 Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln 885 890 895 Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp 900 905 910 Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala 915 920 925 Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 930 935 940 Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu 945 950 955 960 Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu 965 970 975 Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 980 985 990 Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 995 1000 1005 Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu 1010 1015 1020 Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly 1025 1030 1035 1040 Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Gly Gly Gly Thr Gly 1045 1050 1055 Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1060 1065 1070 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 1075 1080 1085 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 1090 1095 1100 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 1105 1110 1115 1120 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 1125 1130 1135 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 1140 1145 1150 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 1155 1160 1165 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 1170 1175 1180 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 1185 1190 1195 1200 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 1205 1210 1215 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 1220 1225 1230 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 1235 1240 1245 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 1250 1255 1260 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 1265 1270 1275 1280 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 1285 1290 1295 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 1300 1305 1310 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 1315 1320 1325 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 1330 1335 1340 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 1345 1350 1355 1360 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 1365 1370 1375 Leu Phe Gln Pro Tyr Asn Val Gly Leu Glu His His His His His His 1380 1385 1390 25 274 PRT Artificial Sequence 741 25 Val Asn Arg Thr Ala Phe Cys Cys Leu Ser Leu Thr Thr Ala Leu Ile 1 5 10 15 Leu Thr Ala Cys Ser Ser Gly Gly Gly Gly Val Ala Ala Asp Ile Gly 20 25 30 Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro Leu Asp His Lys Asp Lys 35 40 45 Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser Val Arg Lys Asn Glu Lys 50 55 60 Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp 65 70 75 80 Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp Lys Val Ser Arg Phe Asp 85 90 95 Phe Ile Arg Gln Ile Glu Val Asp Gly Gln Leu Ile Thr Leu Glu Ser 100 105 110 Gly Glu Phe Gln Val Tyr Lys Gln Ser His Ser Ala Leu Thr Ala Phe 115 120 125 Gln Thr Glu Gln Ile Gln Asp Ser Glu His Ser Gly Lys Met Val Ala 130 135 140 Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala Gly Glu His Thr Ser Phe 145 150 155 160 Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe 165 170 175 Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr Tyr Thr Ile Asp Phe Ala 180 185 190 Ala Lys Gln Gly Asn Gly Lys Ile Glu His Leu Lys Ser Pro Glu Leu 195 200 205 Asn Val Asp Leu Ala Ala Ala Asp Ile Lys Pro Asp Gly Lys Arg His 210 215 220 Ala Val Ile Ser Gly Ser Val Leu Tyr Asn Gln Ala Glu Lys Gly Ser 225 230 235 240 Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala Gln Glu Val Ala Gly Ser 245 250 255 Ala Glu Val Lys Thr Val Asn Gly Ile Arg His Ile Gly Leu Ala Ala 260 265 270 Lys Gln 26 248 PRT Artificial Sequence deltaG741 26 Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro 1 5 10 15 Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser 20 25 30 Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys 35 40 45 Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp 50 55 60 Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln 65 70 75 80 Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His 85 90 95 Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His 100 105 110 Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala 115 120 125 Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr 130 135 140 Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr 145 150 155 160 Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His 165 170 175 Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys 180 185 190 Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn 195 200 205 Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala 210 215 220 Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg 225 230 235 240 His Ile Gly Leu Ala Ala Lys Gln 245 27 1947 DNA Artificial Sequence deltaG741-961 27 atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60 aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120 aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180 aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240 cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300 accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360 cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420 ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480 acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540 ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600 gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660 tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720 cgccatatcg gccttgccgc caagcaactc gagggtggcg gaggcactgg atccgccaca 780 aacgacgacg atgttaaaaa agctgccact gtggccattg ctgctgccta caacaatggc 840 caagaaatca acggtttcaa agctggagag accatctacg acattgatga agacggcaca 900 attaccaaaa aagacgcaac tgcagccgat gttgaagccg acgactttaa aggtctgggt 960 ctgaaaaaag tcgtgactaa cctgaccaaa accgtcaatg aaaacaaaca aaacgtcgat 1020 gccaaagtaa aagctgcaga atctgaaata gaaaagttaa caaccaagtt agcagacact 1080 gatgccgctt tagcagatac tgatgccgct ctggatgcaa ccaccaacgc cttgaataaa 1140 ttgggagaaa atataacgac atttgctgaa gagactaaga caaatatcgt aaaaattgat 1200 gaaaaattag aagccgtggc tgataccgtc gacaagcatg ccgaagcatt caacgatatc 1260 gccgattcat tggatgaaac caacactaag gcagacgaag ccgtcaaaac cgccaatgaa 1320 gccaaacaga cggccgaaga aaccaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1380 actgcagcag gcaaagccga agctgccgct ggcacagcta atactgcagc cgacaaggcc 1440 gaagctgtcg ctgcaaaagt taccgacatc aaagctgata tcgctacgaa caaagataat 1500 attgctaaaa aagcaaacag tgccgacgtg tacaccagag aagagtctga cagcaaattt 1560 gtcagaattg atggtctgaa cgctactacc gaaaaattgg acacacgctt ggcttctgct 1620 gaaaaatcca ttgccgatca cgatactcgc ctgaacggtt tggataaaac agtgtcagac 1680 ctgcgcaaag aaacccgcca aggccttgca gaacaagccg cgctctccgg tctgttccaa 1740 ccttacaacg tgggtcggtt caatgtaacg gctgcagtcg gcggctacaa atccgaatcg 1800 gcagtcgcca tcggtaccgg cttccgcttt accgaaaact ttgccgccaa agcaggcgtg 1860 gcagtcggca cttcgtccgg ttcttccgca gcctaccatg tcggcgtcaa ttacgagtgg 1920 ctcgagcacc accaccacca ccactga 1947 28 648 PRT Artificial Sequence deltaG741-961 28 Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala 1 5 10 15 Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln 20 25 30 Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu 35 40 45 Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn 50 55 60 Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly 65 70 75 80 Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser 85 90 95 His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu 100 105 110 His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile 115 120 125 Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala 130 135 140 Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu 145 150 155 160 Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu 165 170 175 His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile 180 185 190 Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr 195 200 205 Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys 210 215 220 Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile 225 230 235 240 Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu Gly Gly Gly Gly Thr 245 250 255 Gly Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala 260 265 270 Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala 275 280 285 Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys 290 295 300 Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly 305 310 315 320 Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys 325 330 335 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys 340 345 350 Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp 355 360 365 Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn 370 375 380 Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp 385 390 395 400 Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala 405 410 415 Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp 420 425 430 Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr 435 440 445 Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly 450 455 460 Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala 465 470 475 480 Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr 485 490 495 Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr 500 505 510 Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala 515 520 525 Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile 530 535 540 Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp 545 550 555 560 Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser 565 570 575 Gly Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala 580 585 590 Val Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe 595 600 605 Arg Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr 610 615 620 Ser Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp 625 630 635 640 Leu Glu His His His His His His 645 29 1782 DNA Artificial Sequence deltaG741-961c 29 atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60 aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120 aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180 aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240 cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300 accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360 cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420 ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480 acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540 ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600 gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660 tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720 cgccatatcg gccttgccgc caagcaactc gagggtggcg gaggcactgg atccgccaca 780 aacgacgacg atgttaaaaa agctgccact gtggccattg ctgctgccta caacaatggc 840 caagaaatca acggtttcaa agctggagag accatctacg acattgatga agacggcaca 900 attaccaaaa aagacgcaac tgcagccgat gttgaagccg acgactttaa aggtctgggt 960 ctgaaaaaag tcgtgactaa cctgaccaaa accgtcaatg aaaacaaaca aaacgtcgat 1020 gccaaagtaa aagctgcaga atctgaaata gaaaagttaa caaccaagtt agcagacact 1080 gatgccgctt tagcagatac tgatgccgct ctggatgcaa ccaccaacgc cttgaataaa 1140 ttgggagaaa atataacgac atttgctgaa gagactaaga caaatatcgt aaaaattgat 1200 gaaaaattag aagccgtggc tgataccgtc gacaagcatg ccgaagcatt caacgatatc 1260 gccgattcat tggatgaaac caacactaag gcagacgaag ccgtcaaaac cgccaatgaa 1320 gccaaacaga cggccgaaga aaccaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1380 actgcagcag gcaaagccga agctgccgct ggcacagcta atactgcagc cgacaaggcc 1440 gaagctgtcg ctgcaaaagt taccgacatc aaagctgata tcgctacgaa caaagataat 1500 attgctaaaa aagcaaacag tgccgacgtg tacaccagag aagagtctga cagcaaattt 1560 gtcagaattg atggtctgaa cgctactacc gaaaaattgg acacacgctt ggcttctgct 1620 gaaaaatcca ttgccgatca cgatactcgc ctgaacggtt tggataaaac agtgtcagac 1680 ctgcgcaaag aaacccgcca aggccttgca gaacaagccg cgctctccgg tctgttccaa 1740 ccttacaacg tgggtctcga gcaccaccac caccaccact ga 1782 30 593 PRT Artificial Sequence deltaG741-961c 30 Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala 1 5 10 15 Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln 20 25 30 Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu 35 40 45 Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn 50 55 60 Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly 65 70 75 80 Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser 85 90 95 His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu 100 105 110 His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile 115 120 125 Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala 130 135 140 Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu 145 150 155 160 Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu 165 170 175 His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile 180 185 190 Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr 195 200 205 Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys 210 215 220 Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile 225 230 235 240 Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu Gly Gly Gly Gly Thr 245 250 255 Gly Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala 260 265 270 Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala 275 280 285 Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys 290 295 300 Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly 305 310 315 320 Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys 325 330 335 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys 340 345 350 Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp 355 360 365 Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn 370 375 380 Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp 385 390 395 400 Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala 405 410 415 Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp 420 425 430 Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr 435 440 445 Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly 450 455 460 Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala 465 470 475 480 Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr 485 490 495 Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr 500 505 510 Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala 515 520 525 Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile 530 535 540 Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp 545 550 555 560 Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser 565 570 575 Gly Leu Phe Gln Pro Tyr Asn Val Gly Leu Glu His His His His His 580 585 590 His 31 3939 DNA Artificial Sequence deltaG741-983 31 atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60 aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120 aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180 aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240 cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300 accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360 cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420 ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480 acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540 ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600 gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660 tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720 cgccatatcg gccttgccgc caagcaactc gagggatccg gcggaggcgg cacttctgcg 780 cccgacttca atgcaggcgg taccggtatc ggcagcaaca gcagagcaac aacagcgaaa 840 tcagcagcag tatcttacgc cggtatcaag aacgaaatgt gcaaagacag aagcatgctc 900 tgtgccggtc gggatgacgt tgcggttaca gacagggatg ccaaaatcaa tgcccccccc 960 ccgaatctgc ataccggaga ctttccaaac ccaaatgacg catacaagaa tttgatcaac 1020 ctcaaacctg caattgaagc aggctataca ggacgcgggg tagaggtagg tatcgtcgac 1080 acaggcgaat ccgtcggcag catatccttt cccgaactgt atggcagaaa agaacacggc 1140 tataacgaaa attacaaaaa ctatacggcg tatatgcgga aggaagcgcc tgaagacgga 1200 ggcggtaaag acattgaagc ttctttcgac gatgaggccg ttatagagac tgaagcaaag 1260 ccgacggata tccgccacgt aaaagaaatc ggacacatcg atttggtctc ccatattatt 1320 ggcgggcgtt ccgtggacgg cagacctgca ggcggtattg cgcccgatgc gacgctacac 1380 ataatgaata cgaatgatga aaccaagaac gaaatgatgg ttgcagccat ccgcaatgca 1440 tgggtcaagc tgggcgaacg tggcgtgcgc atcgtcaata acagttttgg aacaacatcg 1500 agggcaggca ctgccgacct tttccaaata gccaattcgg aggagcagta ccgccaagcg 1560 ttgctcgact attccggcgg tgataaaaca gacgagggta tccgcctgat gcaacagagc 1620 gattacggca acctgtccta ccacatccgt aataaaaaca tgcttttcat cttttcgaca 1680 ggcaatgacg cacaagctca gcccaacaca tatgccctat tgccatttta tgaaaaagac 1740 gctcaaaaag gcattatcac agtcgcaggc gtagaccgca gtggagaaaa gttcaaacgg 1800 gaaatgtatg gagaaccggg tacagaaccg cttgagtatg gctccaacca ttgcggaatt 1860 actgccatgt ggtgcctgtc ggcaccctat gaagcaagcg tccgtttcac ccgtacaaac 1920 ccgattcaaa ttgccggaac atccttttcc gcacccatcg taaccggcac ggcggctctg 1980 ctgctgcaga aatacccgtg gatgagcaac gacaacctgc gtaccacgtt gctgacgacg 2040 gctcaggaca tcggtgcagt cggcgtggac agcaagttcg gctggggact gctggatgcg 2100 ggtaaggcca tgaacggacc cgcgtccttt ccgttcggcg actttaccgc cgatacgaaa 2160 ggtacatccg atattgccta ctccttccgt aacgacattt caggcacggg cggcctgatc 2220 aaaaaaggcg gcagccaact gcaactgcac ggcaacaaca cctatacggg caaaaccatt 2280 atcgaaggcg gttcgctggt gttgtacggc aacaacaaat cggatatgcg cgtcgaaacc 2340 aaaggtgcgc tgatttataa cggggcggca tccggcggca gcctgaacag cgacggcatt 2400 gtctatctgg cagataccga ccaatccggc gcaaacgaaa ccgtacacat caaaggcagt 2460 ctgcagctgg acggcaaagg tacgctgtac acacgtttgg gcaaactgct gaaagtggac 2520 ggtacggcga ttatcggcgg caagctgtac atgtcggcac gcggcaaggg ggcaggctat 2580 ctcaacagta ccggacgacg tgttcccttc ctgagtgccg ccaaaatcgg gcaggattat 2640 tctttcttca caaacatcga aaccgacggc ggcctgctgg cttccctcga cagcgtcgaa 2700 aaaacagcgg gcagtgaagg cgacacgctg tcctattatg tccgtcgcgg caatgcggca 2760 cggactgctt cggcagcggc acattccgcg cccgccggtc tgaaacacgc cgtagaacag 2820 ggcggcagca atctggaaaa cctgatggtc gaactggatg cctccgaatc atccgcaaca 2880 cccgagacgg ttgaaactgc ggcagccgac cgcacagata tgccgggcat ccgcccctac 2940 ggcgcaactt tccgcgcagc ggcagccgta cagcatgcga atgccgccga cggtgtacgc 3000 atcttcaaca gtctcgccgc taccgtctat gccgacagta ccgccgccca tgccgatatg 3060 cagggacgcc gcctgaaagc cgtatcggac gggttggacc acaacggcac gggtctgcgc 3120 gtcatcgcgc aaacccaaca ggacggtgga acgtgggaac agggcggtgt tgaaggcaaa 3180 atgcgcggca gtacccaaac cgtcggcatt gccgcgaaaa ccggcgaaaa tacgacagca 3240 gccgccacac tgggcatggg acgcagcaca tggagcgaaa acagtgcaaa tgcaaaaacc 3300 gacagcatta gtctgtttgc aggcatacgg cacgatgcgg gcgatatcgg ctatctcaaa 3360 ggcctgttct cctacggacg ctacaaaaac agcatcagcc gcagcaccgg tgcggacgaa 3420 catgcggaag gcagcgtcaa cggcacgctg atgcagctgg gcgcactggg cggtgtcaac 3480 gttccgtttg ccgcaacggg agatttgacg gtcgaaggcg gtctgcgcta cgacctgctc 3540 aaacaggatg cattcgccga aaaaggcagt gctttgggct ggagcggcaa cagcctcact 3600 gaaggcacgc tggtcggact cgcgggtctg aagctgtcgc aacccttgag cgataaagcc 3660 gtcctgtttg caacggcggg cgtggaacgc gacctgaacg gacgcgacta cacggtaacg 3720 ggcggcttta ccggcgcgac tgcagcaacc ggcaagacgg gggcacgcaa tatgccgcac 3780 acccgtctgg ttgccggcct gggcgcggat gtcgaattcg gcaacggctg gaacggcttg 3840 gcacgttaca gctacgccgg ttccaaacag tacggcaacc acagcggacg agtcggcgta 3900 ggctaccggt tcctcgagca ccaccaccac caccactga 3939 32 1312 PRT Artificial Sequence deltaG741-983 32 Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala 1 5 10 15 Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln 20 25 30 Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu 35 40 45 Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn 50 55 60 Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly 65 70 75 80 Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser 85 90 95 His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu 100 105 110 His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile 115 120 125 Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala 130 135 140 Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu 145 150 155 160 Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu 165 170 175 His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile 180 185 190 Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr 195 200 205 Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys 210 215 220 Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile 225 230 235 240 Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu Gly Ser Gly Gly Gly 245 250 255 Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser 260 265 270 Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 275 280 285 Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 290 295 300 Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro 305 310 315 320 Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 325 330 335 Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg 340 345 350 Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile 355 360 365 Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 370 375 380 Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 385 390 395 400 Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu 405 410 415 Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His 420 425 430 Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg 435 440 445 Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr 450 455 460 Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala 465 470 475 480 Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe 485 490 495 Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn 500 505 510 Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp 515 520 525 Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn 530 535 540 Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr 545 550 555 560 Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe 565 570 575 Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp 580 585 590 Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 595 600 605 Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp 610 615 620 Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn 625 630 635 640 Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly 645 650 655 Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn 660 665 670 Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly 675 680 685 Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 690 695 700 Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 705 710 715 720 Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr 725 730 735 Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn 740 745 750 Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu 755 760 765 Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu 770 775 780 Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile 785 790 795 800 Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His 805 810 815 Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg 820 825 830 Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys 835 840 845 Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr 850 855 860 Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr 865 870 875 880 Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu 885 890 895 Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr 900 905 910 Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His 915 920 925 Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn 930 935 940 Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr 945 950 955 960 Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 965 970 975 Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His 980 985 990 Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr 995 1000 1005 Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg 1010 1015 1020 Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 1025 1030 1035 1040 Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly 1045 1050 1055 Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala 1060 1065 1070 Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 1075 1080 1085 Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser 1090 1095 1100 Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys 1105 1110 1115 1120 Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr 1125 1130 1135 Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln 1140 1145 1150 Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp 1155 1160 1165 Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala 1170 1175 1180 Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 1185 1190 1195 1200 Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu 1205 1210 1215 Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu 1220 1225 1230 Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 1235 1240 1245 Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 1250 1255 1260 Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu 1265 1270 1275 1280 Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly 1285 1290 1295 Arg Val Gly Val Gly Tyr Arg Phe Leu Glu His His His His His His 1300 1305 1310 33 2028 DNA Artificial Sequence deltaG741-ORF46.1 33 atggtcgccg ccgacatcgg tgcggggctt gccgatgcac taaccgcacc gctcgaccat 60 aaagacaaag gtttgcagtc tttgacgctg gatcagtccg tcaggaaaaa cgagaaactg 120 aagctggcgg cacaaggtgc ggaaaaaact tatggaaacg gtgacagcct caatacgggc 180 aaattgaaga acgacaaggt cagccgtttc gactttatcc gccaaatcga agtggacggg 240 cagctcatta ccttggagag tggagagttc caagtataca aacaaagcca ttccgcctta 300 accgcctttc agaccgagca aatacaagat tcggagcatt ccgggaagat ggttgcgaaa 360 cgccagttca gaatcggcga catagcgggc gaacatacat cttttgacaa gcttcccgaa 420 ggcggcaggg cgacatatcg cgggacggcg ttcggttcag acgatgccgg cggaaaactg 480 acctacacca tagatttcgc cgccaagcag ggaaacggca aaatcgaaca tttgaaatcg 540 ccagaactca atgtcgacct ggccgccgcc gatatcaagc cggatggaaa acgccatgcc 600 gtcatcagcg gttccgtcct ttacaaccaa gccgagaaag gcagttactc cctcggtatc 660 tttggcggaa aagcccagga agttgccggc agcgcggaag tgaaaaccgt aaacggcata 720 cgccatatcg gccttgccgc caagcaactc gacggtggcg gaggcactgg atcctcagat 780 ttggcaaacg attcttttat ccggcaggtt ctcgaccgtc agcatttcga acccgacggg 840 aaataccacc tattcggcag caggggggaa cttgccgagc gcagcggcca tatcggattg 900 ggaaaaatac aaagccatca gttgggcaac ctgatgattc aacaggcggc cattaaagga 960 aatatcggct acattgtccg cttttccgat cacgggcacg aagtccattc ccccttcgac 1020 aaccatgcct cacattccga ttctgatgaa gccggtagtc ccgttgacgg atttagcctt 1080 taccgcatcc attgggacgg atacgaacac catcccgccg acggctatga cgggccacag 1140 ggcggcggct atcccgctcc caaaggcgcg agggatatat acagctacga cataaaaggc 1200 gttgcccaaa atatccgcct caacctgacc gacaaccgca gcaccggaca acggcttgcc 1260 gaccgtttcc acaatgccgg tagtatgctg acgcaaggag taggcgacgg attcaaacgc 1320 gccacccgat acagccccga gctggacaga tcgggcaatg ccgccgaagc cttcaacggc 1380 actgcagata tcgttaaaaa catcatcggc gcggcaggag aaattgtcgg cgcaggcgat 1440 gccgtgcagg gcataagcga aggctcaaac attgctgtca tgcacggctt gggtctgctt 1500 tccaccgaaa acaagatggc gcgcatcaac gatttggcag atatggcgca actcaaagac 1560 tatgccgcag cagccatccg cgattgggca gtccaaaacc ccaatgccgc acaaggcata 1620 gaagccgtca gcaatatctt tatggcagcc atccccatca aagggattgg agctgttcgg 1680 ggaaaatacg gcttgggcgg catcacggca catcctatca agcggtcgca gatgggcgcg 1740 atcgcattgc cgaaagggaa atccgccgtc agcgacaatt ttgccgatgc ggcatacgcc 1800 aaatacccgt ccccttacca ttcccgaaat atccgttcaa acttggagca gcgttacggc 1860 aaagaaaaca tcacctcctc aaccgtgccg ccgtcaaacg gcaaaaatgt caaactggca 1920 gaccaacgcc acccgaagac aggcgtaccg tttgacggta aagggtttcc gaattttgag 1980 aagcacgtga aatatgatac gctcgagcac caccaccacc accactga 2028 34 675 PRT Artificial Sequence deltaG741-ORF46.1 34 Met Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala 1 5 10 15 Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln 20 25 30 Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu 35 40 45 Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn 50 55 60 Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly 65 70 75 80 Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser 85 90 95 His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu 100 105 110 His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile 115 120 125 Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala 130 135 140 Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu 145 150 155 160 Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu 165 170 175 His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile 180 185 190 Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr 195 200 205 Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys 210 215 220 Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile 225 230 235 240 Arg His Ile Gly Leu Ala Ala Lys Gln Leu Asp Gly Gly Gly Gly Thr 245 250 255 Gly Ser Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp 260 265 270 Arg Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg 275 280 285 Gly Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln 290 295 300 Ser His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly 305 310 315 320 Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His 325 330 335 Ser Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly 340 345 350 Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr 355 360 365 Glu His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr 370 375 380 Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly 385 390 395 400 Val Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly 405 410 415 Gln Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln 420 425 430 Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu 435 440 445 Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile 450 455 460 Val Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp 465 470 475 480 Ala Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly 485 490 495 Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu 500 505 510 Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp 515 520 525 Trp Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser 530 535 540 Asn Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg 545 550 555 560 Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser 565 570 575 Gln Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp 580 585 590 Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser 595 600 605 Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile 610 615 620 Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala 625 630 635 640 Asp Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe 645 650 655 Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu His His His 660 665 670 His His His 675 35 2019 DNA Artificial Sequence ORF46.1-741 35 atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 60 cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 120 atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 180 attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 240 cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 300 tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 360 gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 420 ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 480 cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 540 ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 600 ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 660 gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 720 ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 780 ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 840 caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 900 gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 960 atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 1020 gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 1080 cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 1140 aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 1200 aattttgaga agcacgtgaa atatgatacg ggatccggag ggggtggtgt cgccgccgac 1260 atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 1320 cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 1380 ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 1440 aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 1500 gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 1560 gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 1620 ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 1680 tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 1740 ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 1800 gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 1860 gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 1920 caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 1980 gccgccaagc aactcgagca ccaccaccac caccactga 2019 36 672 PRT Artificial Sequence ORF46.1-741 36 Met Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg 1 5 10 15 Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly 20 25 30 Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser 35 40 45 His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn 50 55 60 Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser 65 70 75 80 Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser 85 90 95 Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu 100 105 110 His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro 115 120 125 Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val 130 135 140 Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln 145 150 155 160 Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly 165 170 175 Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp 180 185 190 Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val 195 200 205 Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala 210 215 220 Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu 225 230 235 240 Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala 245 250 255 Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp 260 265 270 Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn 275 280 285 Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly 290 295 300 Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln 305 310 315 320 Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn 325 330 335 Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg 340 345 350 Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr 355 360 365 Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp 370 375 380 Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro 385 390 395 400 Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly 405 410 415 Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro 420 425 430 Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser 435 440 445 Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys 450 455 460 Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp 465 470 475 480 Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln 485 490 495 Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His 500 505 510 Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His 515 520 525 Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala 530 535 540 Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr 545 550 555 560 Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr 565 570 575 Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His 580 585 590 Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys 595 600 605 Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn 610 615 620 Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala 625 630 635 640 Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg 645 650 655 His Ile Gly Leu Ala Ala Lys Gln Leu Glu His His His His His His 660 665 670 37 2421 DNA Artificial Sequence ORF46.1-961 37 atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 60 cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 120 atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 180 attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 240 cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 300 tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 360 gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 420 ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 480 cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 540 ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 600 ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 660 gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 720 ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 780 ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 840 caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 900 gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 960 atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 1020 gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 1080 cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 1140 aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 1200 aattttgaga agcacgtgaa atatgatacg ggatccggag gaggaggagc cacaaacgac 1260 gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1320 atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1380 aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1440 aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1500 gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1560 gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1620 gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1680 ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1740 tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1800 cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 1860 gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 1920 gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 1980 aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2040 attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2100 tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2160 aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2220 aacgtgggtc ggttcaatgt aacggctgca gtcggcggct acaaatccga atcggcagtc 2280 gccatcggta ccggcttccg ctttaccgaa aactttgccg ccaaagcagg cgtggcagtc 2340 ggcacttcgt ccggttcttc cgcagcctac catgtcggcg tcaattacga gtggctcgag 2400 caccaccacc accaccactg a 2421 38 806 PRT Artificial Sequence ORF46.1-961 38 Met Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg 1 5 10 15 Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly 20 25 30 Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser 35 40 45 His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn 50 55 60 Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser 65 70 75 80 Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser 85 90 95 Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu 100 105 110 His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro 115 120 125 Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val 130 135 140 Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln 145 150 155 160 Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly 165 170 175 Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp 180 185 190 Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val 195 200 205 Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala 210 215 220 Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu 225 230 235 240 Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala 245 250 255 Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp 260 265 270 Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn 275 280 285 Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly 290 295 300 Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln 305 310 315 320 Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn 325 330 335 Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg 340 345 350 Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr 355 360 365 Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp 370 375 380 Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro 385 390 395 400 Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly 405 410 415 Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala 420 425 430 Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu 435 440 445 Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala 450 455 460 Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys 465 470 475 480 Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn 485 490 495 Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr 500 505 510 Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala 515 520 525 Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr 530 535 540 Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys 545 550 555 560 Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn 565 570 575 Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala 580 585 590 Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln 595 600 605 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala 610 615 620 Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala 625 630 635 640 Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys 645 650 655 Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu 660 665 670 Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr 675 680 685 Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp 690 695 700 His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg 705 710 715 720 Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu 725 730 735 Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val Gly 740 745 750 Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg Phe 755 760 765 Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser 770 775 780 Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Leu Glu 785 790 795 800 His His His His His His 805 39 2256 DNA Artificial Sequence ORF46.1-961c 39 atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 60 cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 120 atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 180 attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 240 cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 300 tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 360 gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 420 ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 480 cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 540 ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 600 ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 660 gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 720 ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 780 ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 840 caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 900 gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 960 atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 1020 gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 1080 cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 1140 aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 1200 aattttgaga agcacgtgaa atatgatacg ggatccggag gaggaggagc cacaaacgac 1260 gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1320 atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1380 aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1440 aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1500 gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1560 gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1620 gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1680 ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1740 tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1800 cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 1860 gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 1920 gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 1980 aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2040 attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2100 tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2160 aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2220 aacgtgggtc tcgagcacca ccaccaccac cactga 2256 40 751 PRT Artificial Sequence ORF46.1-961c 40 Met Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg 1 5 10 15 Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly 20 25 30 Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser 35 40 45 His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn 50 55 60 Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser 65 70 75 80 Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser 85 90 95 Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu 100 105 110 His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro 115 120 125 Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val 130 135 140 Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln 145 150 155 160 Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly 165 170 175 Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp 180 185 190 Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val 195 200 205 Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala 210 215 220 Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu 225 230 235 240 Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala 245 250 255 Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp 260 265 270 Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn 275 280 285 Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly 290 295 300 Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln 305 310 315 320 Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn 325 330 335 Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg 340 345 350 Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr 355 360 365 Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp 370 375 380 Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro 385 390 395 400 Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly 405 410 415 Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile Ala 420 425 430 Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly Glu 435 440 445 Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp Ala 450 455 460 Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys 465 470 475 480 Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln Asn 485 490 495 Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu Thr 500 505 510 Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala 515 520 525 Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile Thr 530 535 540 Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu Lys 545 550 555 560 Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe Asn 565 570 575 Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala 580 585 590 Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys Gln 595 600 605 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala 610 615 620 Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala 625 630 635 640 Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn Lys 645 650 655 Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu 660 665 670 Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr Thr 675 680 685 Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala Asp 690 695 700 His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg 705 710 715 720 Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly Leu 725 730 735 Phe Gln Pro Tyr Asn Val Gly Leu Glu His His His His His His 740 745 750 41 2421 DNA Artificial Sequence 961-ORF46.1 41 atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020 tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080 gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140 tacgagtggg gatccggagg aggaggatca gatttggcaa acgattcttt tatccggcag 1200 gttctcgacc gtcagcattt cgaacccgac gggaaatacc acctattcgg cagcaggggg 1260 gaacttgccg agcgcagcgg ccatatcgga ttgggaaaaa tacaaagcca tcagttgggc 1320 aacctgatga ttcaacaggc ggccattaaa ggaaatatcg gctacattgt ccgcttttcc 1380 gatcacgggc acgaagtcca ttcccccttc gacaaccatg cctcacattc cgattctgat 1440 gaagccggta gtcccgttga cggatttagc ctttaccgca tccattggga cggatacgaa 1500 caccatcccg ccgacggcta tgacgggcca cagggcggcg gctatcccgc tcccaaaggc 1560 gcgagggata tatacagcta cgacataaaa ggcgttgccc aaaatatccg cctcaacctg 1620 accgacaacc gcagcaccgg acaacggctt gccgaccgtt tccacaatgc cggtagtatg 1680 ctgacgcaag gagtaggcga cggattcaaa cgcgccaccc gatacagccc cgagctggac 1740 agatcgggca atgccgccga agccttcaac ggcactgcag atatcgttaa aaacatcatc 1800 ggcgcggcag gagaaattgt cggcgcaggc gatgccgtgc agggcataag cgaaggctca 1860 aacattgctg tcatgcacgg cttgggtctg ctttccaccg aaaacaagat ggcgcgcatc 1920 aacgatttgg cagatatggc gcaactcaaa gactatgccg cagcagccat ccgcgattgg 1980 gcagtccaaa accccaatgc cgcacaaggc atagaagccg tcagcaatat ctttatggca 2040 gccatcccca tcaaagggat tggagctgtt cggggaaaat acggcttggg cggcatcacg 2100 gcacatccta tcaagcggtc gcagatgggc gcgatcgcat tgccgaaagg gaaatccgcc 2160 gtcagcgaca attttgccga tgcggcatac gccaaatacc cgtcccctta ccattcccga 2220 aatatccgtt caaacttgga gcagcgttac ggcaaagaaa acatcacctc ctcaaccgtg 2280 ccgccgtcaa acggcaaaaa tgtcaaactg gcagaccaac gccacccgaa gacaggcgta 2340 ccgtttgacg gtaaagggtt tccgaatttt gagaagcacg tgaaatatga tacgctcgag 2400 caccaccacc accaccactg a 2421 42 806 PRT Artificial Sequence 961-ORF46.1 42 Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1 5 10 15 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 20 25 30 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 35 40 45 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 50 55 60 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 65 70 75 80 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 85 90 95 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 100 105 110 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 115 120 125 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 130 135 140 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 145 150 155 160 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 165 170 175 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 180 185 190 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 195 200 205 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 210 215 220 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 225 230 235 240 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 245 250 255 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 260 265 270 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 275 280 285 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 290 295 300 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 305 310 315 320 Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val 325 330 335 Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg 340 345 350 Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser 355 360 365 Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Gly 370 375 380 Ser Gly Gly Gly Gly Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln 385 390 395 400 Val Leu Asp Arg Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe 405 410 415 Gly Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly 420 425 430 Lys Ile Gln Ser His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala 435 440 445 Ile Lys Gly Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His 450 455 460 Glu Val His Ser Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp 465 470 475 480 Glu Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp 485 490 495 Asp Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly 500 505 510 Gly Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp 515 520 525 Ile Lys Gly Val Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg 530 535 540 Ser Thr Gly Gln Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met 545 550 555 560 Leu Thr Gln Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser 565 570 575 Pro Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr 580 585 590 Ala Asp Ile Val Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly 595 600 605 Ala Gly Asp Ala Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val 610 615 620 Met His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile 625 630 635 640 Asn Asp Leu Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala 645 650 655 Ile Arg Asp Trp Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu 660 665 670 Ala Val Ser Asn Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly 675 680 685 Ala Val Arg Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile 690 695 700 Lys Arg Ser Gln Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala 705 710 715 720 Val Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro 725 730 735 Tyr His Ser Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys 740 745 750 Glu Asn Ile Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val 755 760 765 Lys Leu Ala Asp Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly 770 775 780 Lys Gly Phe Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu 785 790 795 800 His His His His His His 805 43 1938 DNA Artificial Sequence 961-741 43 atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020 tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080 gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140 tacgagtggg gatccggagg gggtggtgtc gccgccgaca tcggtgcggg gcttgccgat 1200 gcactaaccg caccgctcga ccataaagac aaaggtttgc agtctttgac gctggatcag 1260 tccgtcagga aaaacgagaa actgaagctg gcggcacaag gtgcggaaaa aacttatgga 1320 aacggtgaca gcctcaatac gggcaaattg aagaacgaca aggtcagccg tttcgacttt 1380 atccgccaaa tcgaagtgga cgggcagctc attaccttgg agagtggaga gttccaagta 1440 tacaaacaaa gccattccgc cttaaccgcc tttcagaccg agcaaataca agattcggag 1500 cattccggga agatggttgc gaaacgccag ttcagaatcg gcgacatagc gggcgaacat 1560 acatcttttg acaagcttcc cgaaggcggc agggcgacat atcgcgggac ggcgttcggt 1620 tcagacgatg ccggcggaaa actgacctac accatagatt tcgccgccaa gcagggaaac 1680 ggcaaaatcg aacatttgaa atcgccagaa ctcaatgtcg acctggccgc cgccgatatc 1740 aagccggatg gaaaacgcca tgccgtcatc agcggttccg tcctttacaa ccaagccgag 1800 aaaggcagtt actccctcgg tatctttggc ggaaaagccc aggaagttgc cggcagcgcg 1860 gaagtgaaaa ccgtaaacgg catacgccat atcggccttg ccgccaagca actcgagcac 1920 caccaccacc accactga 1938 44 645 PRT Artificial Sequence 961-741 44 Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1 5 10 15 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 20 25 30 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 35 40 45 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 50 55 60 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 65 70 75 80 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 85 90 95 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 100 105 110 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 115 120 125 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 130 135 140 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 145 150 155 160 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 165 170 175 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 180 185 190 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 195 200 205 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 210 215 220 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 225 230 235 240 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 245 250 255 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 260 265 270 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 275 280 285 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 290 295 300 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 305 310 315 320 Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val 325 330 335 Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg 340 345 350 Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser 355 360 365 Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Gly 370 375 380 Ser Gly Gly Gly Gly Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp 385 390 395 400 Ala Leu Thr Ala Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu 405 410 415 Thr Leu Asp Gln Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala 420 425 430 Gln Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly 435 440 445 Lys Leu Lys Asn Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile 450 455 460 Glu Val Asp Gly Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val 465 470 475 480 Tyr Lys Gln Ser His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile 485 490 495 Gln Asp Ser Glu His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg 500 505 510 Ile Gly Asp Ile Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu 515 520 525 Gly Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala 530 535 540 Gly Gly Lys Leu Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn 545 550 555 560 Gly Lys Ile Glu His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala 565 570 575 Ala Ala Asp Ile Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly 580 585 590 Ser Val Leu Tyr Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile 595 600 605 Phe Gly Gly Lys Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr 610 615 620 Val Asn Gly Ile Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu His 625 630 635 640 His His His His His 645 45 4335 DNA Artificial Sequence 961-983 45 atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020 tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080 gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140 tacgagtggg gatccggcgg aggcggcact tctgcgcccg acttcaatgc aggcggtacc 1200 ggtatcggca gcaacagcag agcaacaaca gcgaaatcag cagcagtatc ttacgccggt 1260 atcaagaacg aaatgtgcaa agacagaagc atgctctgtg ccggtcggga tgacgttgcg 1320 gttacagaca gggatgccaa aatcaatgcc ccccccccga atctgcatac cggagacttt 1380 ccaaacccaa atgacgcata caagaatttg atcaacctca aacctgcaat tgaagcaggc 1440 tatacaggac gcggggtaga ggtaggtatc gtcgacacag gcgaatccgt cggcagcata 1500 tcctttcccg aactgtatgg cagaaaagaa cacggctata acgaaaatta caaaaactat 1560 acggcgtata tgcggaagga agcgcctgaa gacggaggcg gtaaagacat tgaagcttct 1620 ttcgacgatg aggccgttat agagactgaa gcaaagccga cggatatccg ccacgtaaaa 1680 gaaatcggac acatcgattt ggtctcccat attattggcg ggcgttccgt ggacggcaga 1740 cctgcaggcg gtattgcgcc cgatgcgacg ctacacataa tgaatacgaa tgatgaaacc 1800 aagaacgaaa tgatggttgc agccatccgc aatgcatggg tcaagctggg cgaacgtggc 1860 gtgcgcatcg tcaataacag ttttggaaca acatcgaggg caggcactgc cgaccttttc 1920 caaatagcca attcggagga gcagtaccgc caagcgttgc tcgactattc cggcggtgat 1980 aaaacagacg agggtatccg cctgatgcaa cagagcgatt acggcaacct gtcctaccac 2040 atccgtaata aaaacatgct tttcatcttt tcgacaggca atgacgcaca agctcagccc 2100 aacacatatg ccctattgcc attttatgaa aaagacgctc aaaaaggcat tatcacagtc 2160 gcaggcgtag accgcagtgg agaaaagttc aaacgggaaa tgtatggaga accgggtaca 2220 gaaccgcttg agtatggctc caaccattgc ggaattactg ccatgtggtg cctgtcggca 2280 ccctatgaag caagcgtccg tttcacccgt acaaacccga ttcaaattgc cggaacatcc 2340 ttttccgcac ccatcgtaac cggcacggcg gctctgctgc tgcagaaata cccgtggatg 2400 agcaacgaca acctgcgtac cacgttgctg acgacggctc aggacatcgg tgcagtcggc 2460 gtggacagca agttcggctg gggactgctg gatgcgggta aggccatgaa cggacccgcg 2520 tcctttccgt tcggcgactt taccgccgat acgaaaggta catccgatat tgcctactcc 2580 ttccgtaacg acatttcagg cacgggcggc ctgatcaaaa aaggcggcag ccaactgcaa 2640 ctgcacggca acaacaccta tacgggcaaa accattatcg aaggcggttc gctggtgttg 2700 tacggcaaca acaaatcgga tatgcgcgtc gaaaccaaag gtgcgctgat ttataacggg 2760 gcggcatccg gcggcagcct gaacagcgac ggcattgtct atctggcaga taccgaccaa 2820 tccggcgcaa acgaaaccgt acacatcaaa ggcagtctgc agctggacgg caaaggtacg 2880 ctgtacacac gtttgggcaa actgctgaaa gtggacggta cggcgattat cggcggcaag 2940 ctgtacatgt cggcacgcgg caagggggca ggctatctca acagtaccgg acgacgtgtt 3000 cccttcctga gtgccgccaa aatcgggcag gattattctt tcttcacaaa catcgaaacc 3060 gacggcggcc tgctggcttc cctcgacagc gtcgaaaaaa cagcgggcag tgaaggcgac 3120 acgctgtcct attatgtccg tcgcggcaat gcggcacgga ctgcttcggc agcggcacat 3180 tccgcgcccg ccggtctgaa acacgccgta gaacagggcg gcagcaatct ggaaaacctg 3240 atggtcgaac tggatgcctc cgaatcatcc gcaacacccg agacggttga aactgcggca 3300 gccgaccgca cagatatgcc gggcatccgc ccctacggcg caactttccg cgcagcggca 3360 gccgtacagc atgcgaatgc cgccgacggt gtacgcatct tcaacagtct cgccgctacc 3420 gtctatgccg acagtaccgc cgcccatgcc gatatgcagg gacgccgcct gaaagccgta 3480 tcggacgggt tggaccacaa cggcacgggt ctgcgcgtca tcgcgcaaac ccaacaggac 3540 ggtggaacgt gggaacaggg cggtgttgaa ggcaaaatgc gcggcagtac ccaaaccgtc 3600 ggcattgccg cgaaaaccgg cgaaaatacg acagcagccg ccacactggg catgggacgc 3660 agcacatgga gcgaaaacag tgcaaatgca aaaaccgaca gcattagtct gtttgcaggc 3720 atacggcacg atgcgggcga tatcggctat ctcaaaggcc tgttctccta cggacgctac 3780 aaaaacagca tcagccgcag caccggtgcg gacgaacatg cggaaggcag cgtcaacggc 3840 acgctgatgc agctgggcgc actgggcggt gtcaacgttc cgtttgccgc aacgggagat 3900 ttgacggtcg aaggcggtct gcgctacgac ctgctcaaac aggatgcatt cgccgaaaaa 3960 ggcagtgctt tgggctggag cggcaacagc ctcactgaag gcacgctggt cggactcgcg 4020 ggtctgaagc tgtcgcaacc cttgagcgat aaagccgtcc tgtttgcaac ggcgggcgtg 4080 gaacgcgacc tgaacggacg cgactacacg gtaacgggcg gctttaccgg cgcgactgca 4140 gcaaccggca agacgggggc acgcaatatg ccgcacaccc gtctggttgc cggcctgggc 4200 gcggatgtcg aattcggcaa cggctggaac ggcttggcac gttacagcta cgccggttcc 4260 aaacagtacg gcaaccacag cggacgagtc ggcgtaggct accggttcct cgagcaccac 4320 caccaccacc actga 4335 46 1444 PRT Artificial Sequence 961-983 46 Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1 5 10 15 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 20 25 30 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 35 40 45 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 50 55 60 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 65 70 75 80 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 85 90 95 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 100 105 110 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 115 120 125 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 130 135 140 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 145 150 155 160 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 165 170 175 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 180 185 190 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 195 200 205 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 210 215 220 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 225 230 235 240 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 245 250 255 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 260 265 270 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 275 280 285 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 290 295 300 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 305 310 315 320 Leu Phe Gln Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val 325 330 335 Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala Ile Gly Thr Gly Phe Arg 340 345 350 Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser 355 360 365 Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp Gly 370 375 380 Ser Gly Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr 385 390 395 400 Gly Ile Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val 405 410 415 Ser Tyr Ala Gly Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu 420 425 430 Cys Ala Gly Arg Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile 435 440 445 Asn Ala Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn 450 455 460 Asp Ala Tyr Lys Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly 465 470 475 480 Tyr Thr Gly Arg Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser 485 490 495 Val Gly Ser Ile Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly 500 505 510 Tyr Asn Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala 515 520 525 Pro Glu Asp Gly Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu 530 535 540 Ala Val Ile Glu Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys 545 550 555 560 Glu Ile Gly His Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser 565 570 575 Val Asp Gly Arg Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His 580 585 590 Ile Met Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala 595 600 605 Ile Arg Asn Ala Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val 610 615 620 Asn Asn Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe 625 630 635 640 Gln Ile Ala Asn Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr 645 650 655 Ser Gly Gly Asp Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser 660 665 670 Asp Tyr Gly Asn Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe 675 680 685 Ile Phe Ser Thr Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala 690 695 700 Leu Leu Pro Phe Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val 705 710 715 720 Ala Gly Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly 725 730 735 Glu Pro Gly Thr Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile 740 745 750 Thr Ala Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe 755 760 765 Thr Arg Thr Asn Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro 770 775 780 Ile Val Thr Gly Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met 785 790 795 800 Ser Asn Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile 805 810 815 Gly Ala Val Gly Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala 820 825 830 Gly Lys Ala Met Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr 835 840 845 Ala Asp Thr Lys Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp 850 855 860 Ile Ser Gly Thr Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln 865 870 875 880 Leu His Gly Asn Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly 885 890 895 Ser Leu Val Leu Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr 900 905 910 Lys Gly Ala Leu Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn 915 920 925 Ser Asp Gly Ile Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn 930 935 940 Glu Thr Val His Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr 945 950 955 960 Leu Tyr Thr Arg Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile 965 970 975 Ile Gly Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr 980 985 990 Leu Asn Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile 995 1000 1005 Gly Gln Asp Tyr Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu 1010 1015 1020 Leu Ala Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp 1025 1030 1035 1040 Thr Leu Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser 1045 1050 1055 Ala Ala Ala His Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln 1060 1065 1070 Gly Gly Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu 1075 1080 1085 Ser Ser Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr 1090 1095 1100 Asp Met Pro Gly Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala 1105 1110 1115 1120 Ala Val Gln His Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser 1125 1130 1135 Leu Ala Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met 1140 1145 1150 Gln Gly Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly 1155 1160 1165 Thr Gly Leu Arg Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp 1170 1175 1180 Glu Gln Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val 1185 1190 1195 1200 Gly Ile Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu 1205 1210 1215 Gly Met Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr 1220 1225 1230 Asp Ser Ile Ser Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile 1235 1240 1245 Gly Tyr Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile 1250 1255 1260 Ser Arg Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly 1265 1270 1275 1280 Thr Leu Met Gln Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala 1285 1290 1295 Ala Thr Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu 1300 1305 1310 Lys Gln Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly 1315 1320 1325 Asn Ser Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu 1330 1335 1340 Ser Gln Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val 1345 1350 1355 1360 Glu Arg Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr 1365 1370 1375 Gly Ala Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His 1380 1385 1390 Thr Arg Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly 1395 1400 1405 Trp Asn Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly 1410 1415 1420 Asn His Ser Gly Arg Val Gly Val Gly Tyr Arg Phe Leu Glu His His 1425 1430 1435 1440 His His His His 47 2256 DNA Artificial Sequence 961c-ORF46.1 47 atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 ctgttccaac cttacaacgt gggtggatcc ggaggaggag gatcagattt ggcaaacgat 1020 tcttttatcc ggcaggttct cgaccgtcag catttcgaac ccgacgggaa ataccaccta 1080 ttcggcagca ggggggaact tgccgagcgc agcggccata tcggattggg aaaaatacaa 1140 agccatcagt tgggcaacct gatgattcaa caggcggcca ttaaaggaaa tatcggctac 1200 attgtccgct tttccgatca cgggcacgaa gtccattccc ccttcgacaa ccatgcctca 1260 cattccgatt ctgatgaagc cggtagtccc gttgacggat ttagccttta ccgcatccat 1320 tgggacggat acgaacacca tcccgccgac ggctatgacg ggccacaggg cggcggctat 1380 cccgctccca aaggcgcgag ggatatatac agctacgaca taaaaggcgt tgcccaaaat 1440 atccgcctca acctgaccga caaccgcagc accggacaac ggcttgccga ccgtttccac 1500 aatgccggta gtatgctgac gcaaggagta ggcgacggat tcaaacgcgc cacccgatac 1560 agccccgagc tggacagatc gggcaatgcc gccgaagcct tcaacggcac tgcagatatc 1620 gttaaaaaca tcatcggcgc ggcaggagaa attgtcggcg caggcgatgc cgtgcagggc 1680 ataagcgaag gctcaaacat tgctgtcatg cacggcttgg gtctgctttc caccgaaaac 1740 aagatggcgc gcatcaacga tttggcagat atggcgcaac tcaaagacta tgccgcagca 1800 gccatccgcg attgggcagt ccaaaacccc aatgccgcac aaggcataga agccgtcagc 1860 aatatcttta tggcagccat ccccatcaaa gggattggag ctgttcgggg aaaatacggc 1920 ttgggcggca tcacggcaca tcctatcaag cggtcgcaga tgggcgcgat cgcattgccg 1980 aaagggaaat ccgccgtcag cgacaatttt gccgatgcgg catacgccaa atacccgtcc 2040 ccttaccatt cccgaaatat ccgttcaaac ttggagcagc gttacggcaa agaaaacatc 2100 acctcctcaa ccgtgccgcc gtcaaacggc aaaaatgtca aactggcaga ccaacgccac 2160 ccgaagacag gcgtaccgtt tgacggtaaa gggtttccga attttgagaa gcacgtgaaa 2220 tatgatacgc tcgagcacca ccaccaccac cactga 2256 48 751 PRT Artificial Sequence 961c-ORF46.1 48 Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1 5 10 15 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 20 25 30 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 35 40 45 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 50 55 60 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 65 70 75 80 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 85 90 95 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 100 105 110 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 115 120 125 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 130 135 140 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 145 150 155 160 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 165 170 175 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 180 185 190 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 195 200 205 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 210 215 220 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 225 230 235 240 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 245 250 255 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 260 265 270 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 275 280 285 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 290 295 300 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 305 310 315 320 Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Ser Asp 325 330 335 Leu Ala Asn Asp Ser Phe Ile Arg Gln Val Leu Asp Arg Gln His Phe 340 345 350 Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly Glu Leu Ala 355 360 365 Glu Arg Ser Gly His Ile Gly Leu Gly Lys Ile Gln Ser His Gln Leu 370 375 380 Gly Asn Leu Met Ile Gln Gln Ala Ala Ile Lys Gly Asn Ile Gly Tyr 385 390 395 400 Ile Val Arg Phe Ser Asp His Gly His Glu Val His Ser Pro Phe Asp 405 410 415 Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser Pro Val Asp 420 425 430 Gly Phe Ser Leu Tyr Arg Ile His Trp Asp Gly Tyr Glu His His Pro 435 440 445 Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly Gly Tyr Pro Ala Pro Lys 450 455 460 Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile Lys Gly Val Ala Gln Asn 465 470 475 480 Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gln Arg Leu Ala 485 490 495 Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gln Gly Val Gly Asp 500 505 510 Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp Arg Ser Gly 515 520 525 Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp Ile Val Lys Asn Ile 530 535 540 Ile Gly Ala Ala Gly Glu Ile Val Gly Ala Gly Asp Ala Val Gln Gly 545 550 555 560 Ile Ser Glu Gly Ser Asn Ile Ala Val Met His Gly Leu Gly Leu Leu 565 570 575 Ser Thr Glu Asn Lys Met Ala Arg Ile Asn Asp Leu Ala Asp Met Ala 580 585 590 Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile Arg Asp Trp Ala Val Gln 595 600 605 Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala Val Ser Asn Ile Phe Met 610 615 620 Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala Val Arg Gly Lys Tyr Gly 625 630 635 640 Leu Gly Gly Ile Thr Ala His Pro Ile Lys Arg Ser Gln Met Gly Ala 645 650 655 Ile Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn Phe Ala Asp 660 665 670 Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg Asn Ile Arg 675 680 685 Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu Asn Ile Thr Ser Ser Thr 690 695 700 Val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala Asp Gln Arg His 705 710 715 720 Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro Asn Phe Glu 725 730 735 Lys His Val Lys Tyr Asp Thr Leu Glu His His His His His His 740 745 750 49 1773 DNA Artificial Sequence 961c-741 49 atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 ctgttccaac cttacaacgt gggtggatcc ggagggggtg gtgtcgccgc cgacatcggt 1020 gcggggcttg ccgatgcact aaccgcaccg ctcgaccata aagacaaagg tttgcagtct 1080 ttgacgctgg atcagtccgt caggaaaaac gagaaactga agctggcggc acaaggtgcg 1140 gaaaaaactt atggaaacgg tgacagcctc aatacgggca aattgaagaa cgacaaggtc 1200 agccgtttcg actttatccg ccaaatcgaa gtggacgggc agctcattac cttggagagt 1260 ggagagttcc aagtatacaa acaaagccat tccgccttaa ccgcctttca gaccgagcaa 1320 atacaagatt cggagcattc cgggaagatg gttgcgaaac gccagttcag aatcggcgac 1380 atagcgggcg aacatacatc ttttgacaag cttcccgaag gcggcagggc gacatatcgc 1440 gggacggcgt tcggttcaga cgatgccggc ggaaaactga cctacaccat agatttcgcc 1500 gccaagcagg gaaacggcaa aatcgaacat ttgaaatcgc cagaactcaa tgtcgacctg 1560 gccgccgccg atatcaagcc ggatggaaaa cgccatgccg tcatcagcgg ttccgtcctt 1620 tacaaccaag ccgagaaagg cagttactcc ctcggtatct ttggcggaaa agcccaggaa 1680 gttgccggca gcgcggaagt gaaaaccgta aacggcatac gccatatcgg ccttgccgcc 1740 aagcaactcg agcaccacca ccaccaccac tga 1773 50 590 PRT Artificial Sequence 961c-741 50 Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1 5 10 15 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 20 25 30 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 35 40 45 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 50 55 60 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 65 70 75 80 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 85 90 95 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 100 105 110 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 115 120 125 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 130 135 140 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 145 150 155 160 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 165 170 175 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 180 185 190 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 195 200 205 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 210 215 220 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 225 230 235 240 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 245 250 255 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 260 265 270 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 275 280 285 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 290 295 300 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 305 310 315 320 Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Val Ala 325 330 335 Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro Leu Asp 340 345 350 His Lys Asp Lys Gly Leu Gln Ser Leu Thr Leu Asp Gln Ser Val Arg 355 360 365 Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln Gly Ala Glu Lys Thr Tyr 370 375 380 Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp Lys Val 385 390 395 400 Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu Val Asp Gly Gln Leu Ile 405 410 415 Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr Lys Gln Ser His Ser Ala 420 425 430 Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln Asp Ser Glu His Ser Gly 435 440 445 Lys Met Val Ala Lys Arg Gln Phe Arg Ile Gly Asp Ile Ala Gly Glu 450 455 460 His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr Tyr Arg 465 470 475 480 Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr Tyr Thr 485 490 495 Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly Lys Ile Glu His Leu Lys 500 505 510 Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp Ile Lys Pro Asp 515 520 525 Gly Lys Arg His Ala Val Ile Ser Gly Ser Val Leu Tyr Asn Gln Ala 530 535 540 Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe Gly Gly Lys Ala Gln Glu 545 550 555 560 Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly Ile Arg His Ile 565 570 575 Gly Leu Ala Ala Lys Gln Leu Glu His His His His His His 580 585 590 51 4170 DNA Artificial Sequence 961c-983 51 atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 ctgttccaac cttacaacgt gggtggatcc ggcggaggcg gcacttctgc gcccgacttc 1020 aatgcaggcg gtaccggtat cggcagcaac agcagagcaa caacagcgaa atcagcagca 1080 gtatcttacg ccggtatcaa gaacgaaatg tgcaaagaca gaagcatgct ctgtgccggt 1140 cgggatgacg ttgcggttac agacagggat gccaaaatca atgccccccc cccgaatctg 1200 cataccggag actttccaaa cccaaatgac gcatacaaga atttgatcaa cctcaaacct 1260 gcaattgaag caggctatac aggacgcggg gtagaggtag gtatcgtcga cacaggcgaa 1320 tccgtcggca gcatatcctt tcccgaactg tatggcagaa aagaacacgg ctataacgaa 1380 aattacaaaa actatacggc gtatatgcgg aaggaagcgc ctgaagacgg aggcggtaaa 1440 gacattgaag cttctttcga cgatgaggcc gttatagaga ctgaagcaaa gccgacggat 1500 atccgccacg taaaagaaat cggacacatc gatttggtct cccatattat tggcgggcgt 1560 tccgtggacg gcagacctgc aggcggtatt gcgcccgatg cgacgctaca cataatgaat 1620 acgaatgatg aaaccaagaa cgaaatgatg gttgcagcca tccgcaatgc atgggtcaag 1680 ctgggcgaac gtggcgtgcg catcgtcaat aacagttttg gaacaacatc gagggcaggc 1740 actgccgacc ttttccaaat agccaattcg gaggagcagt accgccaagc gttgctcgac 1800 tattccggcg gtgataaaac agacgagggt atccgcctga tgcaacagag cgattacggc 1860 aacctgtcct accacatccg taataaaaac atgcttttca tcttttcgac aggcaatgac 1920 gcacaagctc agcccaacac atatgcccta ttgccatttt atgaaaaaga cgctcaaaaa 1980 ggcattatca cagtcgcagg cgtagaccgc agtggagaaa agttcaaacg ggaaatgtat 2040 ggagaaccgg gtacagaacc gcttgagtat ggctccaacc attgcggaat tactgccatg 2100 tggtgcctgt cggcacccta tgaagcaagc gtccgtttca cccgtacaaa cccgattcaa 2160 attgccggaa catccttttc cgcacccatc gtaaccggca cggcggctct gctgctgcag 2220 aaatacccgt ggatgagcaa cgacaacctg cgtaccacgt tgctgacgac ggctcaggac 2280 atcggtgcag tcggcgtgga cagcaagttc ggctggggac tgctggatgc gggtaaggcc 2340 atgaacggac ccgcgtcctt tccgttcggc gactttaccg ccgatacgaa aggtacatcc 2400 gatattgcct actccttccg taacgacatt tcaggcacgg gcggcctgat caaaaaaggc 2460 ggcagccaac tgcaactgca cggcaacaac acctatacgg gcaaaaccat tatcgaaggc 2520 ggttcgctgg tgttgtacgg caacaacaaa tcggatatgc gcgtcgaaac caaaggtgcg 2580 ctgatttata acggggcggc atccggcggc agcctgaaca gcgacggcat tgtctatctg 2640 gcagataccg accaatccgg cgcaaacgaa accgtacaca tcaaaggcag tctgcagctg 2700 gacggcaaag gtacgctgta cacacgtttg ggcaaactgc tgaaagtgga cggtacggcg 2760 attatcggcg gcaagctgta catgtcggca cgcggcaagg gggcaggcta tctcaacagt 2820 accggacgac gtgttccctt cctgagtgcc gccaaaatcg ggcaggatta ttctttcttc 2880 acaaacatcg aaaccgacgg cggcctgctg gcttccctcg acagcgtcga aaaaacagcg 2940 ggcagtgaag gcgacacgct gtcctattat gtccgtcgcg gcaatgcggc acggactgct 3000 tcggcagcgg cacattccgc gcccgccggt ctgaaacacg ccgtagaaca gggcggcagc 3060 aatctggaaa acctgatggt cgaactggat gcctccgaat catccgcaac acccgagacg 3120 gttgaaactg cggcagccga ccgcacagat atgccgggca tccgccccta cggcgcaact 3180 ttccgcgcag cggcagccgt acagcatgcg aatgccgccg acggtgtacg catcttcaac 3240 agtctcgccg ctaccgtcta tgccgacagt accgccgccc atgccgatat gcagggacgc 3300 cgcctgaaag ccgtatcgga cgggttggac cacaacggca cgggtctgcg cgtcatcgcg 3360 caaacccaac aggacggtgg aacgtgggaa cagggcggtg ttgaaggcaa aatgcgcggc 3420 agtacccaaa ccgtcggcat tgccgcgaaa accggcgaaa atacgacagc agccgccaca 3480 ctgggcatgg gacgcagcac atggagcgaa aacagtgcaa atgcaaaaac cgacagcatt 3540 agtctgtttg caggcatacg gcacgatgcg ggcgatatcg gctatctcaa aggcctgttc 3600 tcctacggac gctacaaaaa cagcatcagc cgcagcaccg gtgcggacga acatgcggaa 3660 ggcagcgtca acggcacgct gatgcagctg ggcgcactgg gcggtgtcaa cgttccgttt 3720 gccgcaacgg gagatttgac ggtcgaaggc ggtctgcgct acgacctgct caaacaggat 3780 gcattcgccg aaaaaggcag tgctttgggc tggagcggca acagcctcac tgaaggcacg 3840 ctggtcggac tcgcgggtct gaagctgtcg caacccttga gcgataaagc cgtcctgttt 3900 gcaacggcgg gcgtggaacg cgacctgaac ggacgcgact acacggtaac gggcggcttt 3960 accggcgcga ctgcagcaac cggcaagacg ggggcacgca atatgccgca cacccgtctg 4020 gttgccggcc tgggcgcgga tgtcgaattc ggcaacggct ggaacggctt ggcacgttac 4080 agctacgccg gttccaaaca gtacggcaac cacagcggac gagtcggcgt aggctaccgg 4140 ttcctcgagc accaccacca ccaccactga 4170 52 1389 PRT Artificial Sequence 961c-983 52 Met Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala Ile 1 5 10 15 Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile Asn Gly Phe Lys Ala Gly 20 25 30 Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly Thr Ile Thr Lys Lys Asp 35 40 45 Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 50 55 60 Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gln 65 70 75 80 Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu Ile Glu Lys Leu 85 90 95 Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 100 105 110 Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn Ile 115 120 125 Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn Ile Val Lys Ile Asp Glu 130 135 140 Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 145 150 155 160 Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 165 170 175 Ala Val Lys Thr Ala Asn Glu Ala Lys Gln Thr Ala Glu Glu Thr Lys 180 185 190 Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 195 200 205 Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 210 215 220 Ala Val Ala Ala Lys Val Thr Asp Ile Lys Ala Asp Ile Ala Thr Asn 225 230 235 240 Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg 245 250 255 Glu Glu Ser Asp Ser Lys Phe Val Arg Ile Asp Gly Leu Asn Ala Thr 260 265 270 Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser Ile Ala 275 280 285 Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 290 295 300 Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu Gln Ala Ala Leu Ser Gly 305 310 315 320 Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Thr Ser 325 330 335 Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly Ile Gly Ser Asn Ser Arg 340 345 350 Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly Ile Lys Asn 355 360 365 Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg Asp Asp Val 370 375 380 Ala Val Thr Asp Arg Asp Ala Lys Ile Asn Ala Pro Pro Pro Asn Leu 385 390 395 400 His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys Asn Leu Ile 405 410 415 Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr Thr Gly Arg Gly Val Glu 420 425 430 Val Gly Ile Val Asp Thr Gly Glu Ser Val Gly Ser Ile Ser Phe Pro 435 440 445 Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn Tyr Lys Asn 450 455 460 Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly Gly Gly Lys 465 470 475 480 Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala Val Ile Glu Thr Glu Ala 485 490 495 Lys Pro Thr Asp Ile Arg His Val Lys Glu Ile Gly His Ile Asp Leu 500 505 510 Val Ser His Ile Ile Gly Gly Arg Ser Val Asp Gly Arg Pro Ala Gly 515 520 525 Gly Ile Ala Pro Asp Ala Thr Leu His Ile Met Asn Thr Asn Asp Glu 530 535 540 Thr Lys Asn Glu Met Met Val Ala Ala Ile Arg Asn Ala Trp Val Lys 545 550 555 560 Leu Gly Glu Arg Gly Val Arg Ile Val Asn Asn Ser Phe Gly Thr Thr 565 570 575 Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln Ile Ala Asn Ser Glu Glu 580 585 590 Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser Gly Gly Asp Lys Thr Asp 595 600 605 Glu Gly Ile Arg Leu Met Gln Gln Ser Asp Tyr Gly Asn Leu Ser Tyr 610 615 620 His Ile Arg Asn Lys Asn Met Leu Phe Ile Phe Ser Thr Gly Asn Asp 625 630 635 640 Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu Leu Pro Phe Tyr Glu Lys 645 650 655 Asp Ala Gln Lys Gly Ile Ile Thr Val Ala Gly Val Asp Arg Ser Gly 660 665 670 Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr Glu Pro Leu 675 680 685 Glu Tyr Gly Ser Asn His Cys Gly Ile Thr Ala Met Trp Cys Leu Ser 690 695 700 Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn Pro Ile Gln 705 710 715 720 Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile Val Thr Gly Thr Ala Ala 725 730 735 Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser Asn Asp Asn Leu Arg Thr 740 745 750 Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly Ala Val Gly Val Asp Ser 755 760 765 Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met Asn Gly Pro 770 775 780 Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys Gly Thr Ser 785 790 795 800 Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile Ser Gly Thr Gly Gly Leu 805 810 815 Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu His Gly Asn Asn Thr Tyr 820 825 830 Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser Leu Val Leu Tyr Gly Asn 835 840 845 Asn Lys Ser Asp Met Arg Val Glu Thr Lys Gly Ala Leu Ile Tyr Asn 850 855 860 Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly Ile Val Tyr Leu 865 870 875 880 Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu Thr Val His Ile Lys Gly 885 890 895 Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg Leu Gly Lys 900 905 910 Leu Leu Lys Val Asp Gly Thr Ala Ile Ile Gly Gly Lys Leu Tyr Met 915 920 925 Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr Gly Arg Arg 930 935 940 Val Pro Phe Leu Ser Ala Ala Lys Ile Gly Gln Asp Tyr Ser Phe Phe 945 950 955 960 Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu Asp Ser Val 965 970 975 Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr Tyr Val Arg 980 985 990 Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His Ser Ala Pro 995 1000 1005 Ala Gly Leu Lys His Ala Val Glu Gln Gly Gly Ser Asn Leu Glu Asn 1010 1015 1020 Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr Pro Glu Thr 1025 1030 1035 1040 Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly Ile Arg Pro 1045 1050 1055 Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gln His Ala Asn Ala 1060 1065 1070 Ala Asp Gly Val Arg Ile Phe Asn Ser Leu Ala Ala Thr Val Tyr Ala 1075 1080 1085 Asp Ser Thr Ala Ala His Ala Asp Met Gln Gly Arg Arg Leu Lys Ala 1090 1095 1100 Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg Val Ile Ala 1105 1110 1115 1120 Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu Gln Gly Gly Val Glu Gly 1125 1130 1135 Lys Met Arg Gly Ser Thr Gln Thr Val Gly Ile Ala Ala Lys Thr Gly 1140 1145 1150 Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg Ser Thr Trp 1155 1160 1165 Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser Ile Ser Leu Phe Ala 1170 1175 1180 Gly Ile Arg His Asp Ala Gly Asp Ile Gly Tyr Leu Lys Gly Leu Phe 1185 1190 1195 1200 Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser Arg Ser Thr Gly Ala Asp 1205 1210 1215 Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gln Leu Gly Ala 1220 1225 1230 Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp Leu Thr Val 1235 1240 1245 Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gln Asp Ala Phe Ala Glu 1250 1255 1260 Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr Glu Gly Thr 1265 1270 1275 1280 Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gln Pro Leu Ser Asp Lys 1285 1290 1295 Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg Asp Leu Asn Gly Arg 1300 1305 1310 Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala Ala Thr Gly 1315 1320 1325 Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val Ala Gly Leu 1330 1335 1340 Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu Ala Arg Tyr 1345 1350 1355 1360 Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn His Ser Gly Arg Val Gly 1365 1370 1375 Val Gly Tyr Arg Phe Leu Glu His His His His His His 1380 1385 53 2304 DNA Artificial Sequence 961cL-ORF46.1 53 atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60 gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120 gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180 gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240 tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300 aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360 aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420 aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480 atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540 gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600 aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660 gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 720 gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 780 acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 840 tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 900 cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 960 aaaacagtgt cagacctgcg caaagaaacc cgccaaggcc ttgcagaaca agccgcgctc 1020 tccggtctgt tccaacctta caacgtgggt ggatccggag gaggaggatc agatttggca 1080 aacgattctt ttatccggca ggttctcgac cgtcagcatt tcgaacccga cgggaaatac 1140 cacctattcg gcagcagggg ggaacttgcc gagcgcagcg gccatatcgg attgggaaaa 1200 atacaaagcc atcagttggg caacctgatg attcaacagg cggccattaa aggaaatatc 1260 ggctacattg tccgcttttc cgatcacggg cacgaagtcc attccccctt cgacaaccat 1320 gcctcacatt ccgattctga tgaagccggt agtcccgttg acggatttag cctttaccgc 1380 atccattggg acggatacga acaccatccc gccgacggct atgacgggcc acagggcggc 1440 ggctatcccg ctcccaaagg cgcgagggat atatacagct acgacataaa aggcgttgcc 1500 caaaatatcc gcctcaacct gaccgacaac cgcagcaccg gacaacggct tgccgaccgt 1560 ttccacaatg ccggtagtat gctgacgcaa ggagtaggcg acggattcaa acgcgccacc 1620 cgatacagcc ccgagctgga cagatcgggc aatgccgccg aagccttcaa cggcactgca 1680 gatatcgtta aaaacatcat cggcgcggca ggagaaattg tcggcgcagg cgatgccgtg 1740 cagggcataa gcgaaggctc aaacattgct gtcatgcacg gcttgggtct gctttccacc 1800 gaaaacaaga tggcgcgcat caacgatttg gcagatatgg cgcaactcaa agactatgcc 1860 gcagcagcca tccgcgattg ggcagtccaa aaccccaatg ccgcacaagg catagaagcc 1920 gtcagcaata tctttatggc agccatcccc atcaaaggga ttggagctgt tcggggaaaa 1980 tacggcttgg gcggcatcac ggcacatcct atcaagcggt cgcagatggg cgcgatcgca 2040 ttgccgaaag ggaaatccgc cgtcagcgac aattttgccg atgcggcata cgccaaatac 2100 ccgtcccctt accattcccg aaatatccgt tcaaacttgg agcagcgtta cggcaaagaa 2160 aacatcacct cctcaaccgt gccgccgtca aacggcaaaa atgtcaaact ggcagaccaa 2220 cgccacccga agacaggcgt accgtttgac ggtaaagggt ttccgaattt tgagaagcac 2280 gtgaaatatg atacgtaact cgag 2304 54 765 PRT Artificial Sequence 961cL-ORF46.1 54 Met Lys His Phe Pro Ser Lys Val Leu Thr Thr Ala Ile Leu Ala Thr 1 5 10 15 Phe Cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp Val Lys Lys 20 25 30 Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile 35 40 45 Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly 50 55 60 Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp 65 70 75 80 Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr 85 90 95 Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu 100 105 110 Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala 115 120 125 Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn 130 135 140 Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn 145 150 155 160 Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp 165 170 175 Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr 180 185 190 Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln 195 200 205 Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala 210 215 220 Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr 225 230 235 240 Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys 245 250 255 Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser 260 265 270 Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile 275 280 285 Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser 290 295 300 Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp 305 310 315 320 Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu 325 330 335 Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser 340 345 350 Gly Gly Gly Gly Ser Asp Leu Ala Asn Asp Ser Phe Ile Arg Gln Val 355 360 365 Leu Asp Arg Gln His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly 370 375 380 Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His Ile Gly Leu Gly Lys 385 390 395 400 Ile Gln Ser His Gln Leu Gly Asn Leu Met Ile Gln Gln Ala Ala Ile 405 410 415 Lys Gly Asn Ile Gly Tyr Ile Val Arg Phe Ser Asp His Gly His Glu 420 425 430 Val His Ser Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu 435 440 445 Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg Ile His Trp Asp 450 455 460 Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly Pro Gln Gly Gly 465 470 475 480 Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp Ile Tyr Ser Tyr Asp Ile 485 490 495 Lys Gly Val Ala Gln Asn Ile Arg Leu Asn Leu Thr Asp Asn Arg Ser 500 505 510 Thr Gly Gln Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu 515 520 525 Thr Gln Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro 530 535 540 Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala 545 550 555 560 Asp Ile Val Lys Asn Ile Ile Gly Ala Ala Gly Glu Ile Val Gly Ala 565 570 575 Gly Asp Ala Val Gln Gly Ile Ser Glu Gly Ser Asn Ile Ala Val Met 580 585 590 His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg Ile Asn 595 600 605 Asp Leu Ala Asp Met Ala Gln Leu Lys Asp Tyr Ala Ala Ala Ala Ile 610 615 620 Arg Asp Trp Ala Val Gln Asn Pro Asn Ala Ala Gln Gly Ile Glu Ala 625 630 635 640 Val Ser Asn Ile Phe Met Ala Ala Ile Pro Ile Lys Gly Ile Gly Ala 645 650 655 Val Arg Gly Lys Tyr Gly Leu Gly Gly Ile Thr Ala His Pro Ile Lys 660 665 670 Arg Ser Gln Met Gly Ala Ile Ala Leu Pro Lys Gly Lys Ser Ala Val 675 680 685 Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr 690 695 700 His Ser Arg Asn Ile Arg Ser Asn Leu Glu Gln Arg Tyr Gly Lys Glu 705 710 715 720 Asn Ile Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys 725 730 735 Leu Ala Asp Gln Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys 740 745 750 Gly Phe Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr 755 760 765 55 1839 DNA Artificial Sequence 961cL-741 55 atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60 gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120 gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180 gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240 tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300 aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360 aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420 aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480 atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540 gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600 aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660 gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 720 gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 780 acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 840 tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 900 cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 960 aaaacagtgt cagacctgcg caaagaaacc cgccaaggcc ttgcagaaca agccgcgctc 1020 tccggtctgt tccaacctta caacgtgggt ggatccggag ggggtggtgt cgccgccgac 1080 atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 1140 cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 1200 ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 1260 aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 1320 gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 1380 gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 1440 ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 1500 tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 1560 ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 1620 gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 1680 gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 1740 caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 1800 gccgccaagc aactcgagca ccaccaccac caccactga 1839 56 612 PRT Artificial Sequence 961cL-741 56 Met Lys His Phe Pro Ser Lys Val Leu Thr Thr Ala Ile Leu Ala Thr 1 5 10 15 Phe Cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp Val Lys Lys 20 25 30 Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile 35 40 45 Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly 50 55 60 Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp 65 70 75 80 Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr 85 90 95 Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu 100 105 110 Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala 115 120 125 Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn 130 135 140 Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn 145 150 155 160 Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp 165 170 175 Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr 180 185 190 Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln 195 200 205 Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala 210 215 220 Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr 225 230 235 240 Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys 245 250 255 Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser 260 265 270 Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile 275 280 285 Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser 290 295 300 Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp 305 310 315 320 Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu 325 330 335 Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser 340 345 350 Gly Gly Gly Gly Val Ala Ala Asp Ile Gly Ala Gly Leu Ala Asp Ala 355 360 365 Leu Thr Ala Pro Leu Asp His Lys Asp Lys Gly Leu Gln Ser Leu Thr 370 375 380 Leu Asp Gln Ser Val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gln 385 390 395 400 Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys 405 410 415 Leu Lys Asn Asp Lys Val Ser Arg Phe Asp Phe Ile Arg Gln Ile Glu 420 425 430 Val Asp Gly Gln Leu Ile Thr Leu Glu Ser Gly Glu Phe Gln Val Tyr 435 440 445 Lys Gln Ser His Ser Ala Leu Thr Ala Phe Gln Thr Glu Gln Ile Gln 450 455 460 Asp Ser Glu His Ser Gly Lys Met Val Ala Lys Arg Gln Phe Arg Ile 465 470 475 480 Gly Asp Ile Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly 485 490 495 Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly 500 505 510 Gly Lys Leu Thr Tyr Thr Ile Asp Phe Ala Ala Lys Gln Gly Asn Gly 515 520 525 Lys Ile Glu His Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala 530 535 540 Ala Asp Ile Lys Pro Asp Gly Lys Arg His Ala Val Ile Ser Gly Ser 545 550 555 560 Val Leu Tyr Asn Gln Ala Glu Lys Gly Ser Tyr Ser Leu Gly Ile Phe 565 570 575 Gly Gly Lys Ala Gln Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val 580 585 590 Asn Gly Ile Arg His Ile Gly Leu Ala Ala Lys Gln Leu Glu His His 595 600 605 His His His His 610 57 4218 DNA Artificial Sequence 961cL-983 57 atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60 gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120 gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180 gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240 tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300 aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360 aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420 aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480 atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540 gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600 aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660 gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 720 gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 780 acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 840 tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 900 cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 960 aaaacagtgt cagacctgcg caaagaaacc cgccaaggcc ttgcagaaca agccgcgctc 1020 tccggtctgt tccaacctta caacgtgggt ggatccggcg gaggcggcac ttctgcgccc 1080 gacttcaatg caggcggtac cggtatcggc agcaacagca gagcaacaac agcgaaatca 1140 gcagcagtat cttacgccgg tatcaagaac gaaatgtgca aagacagaag catgctctgt 1200 gccggtcggg atgacgttgc ggttacagac agggatgcca aaatcaatgc cccccccccg 1260 aatctgcata ccggagactt tccaaaccca aatgacgcat acaagaattt gatcaacctc 1320 aaacctgcaa ttgaagcagg ctatacagga cgcggggtag aggtaggtat cgtcgacaca 1380 ggcgaatccg tcggcagcat atcctttccc gaactgtatg gcagaaaaga acacggctat 1440 aacgaaaatt acaaaaacta tacggcgtat atgcggaagg aagcgcctga agacggaggc 1500 ggtaaagaca ttgaagcttc tttcgacgat gaggccgtta tagagactga agcaaagccg 1560 acggatatcc gccacgtaaa agaaatcgga cacatcgatt tggtctccca tattattggc 1620 gggcgttccg tggacggcag acctgcaggc ggtattgcgc ccgatgcgac gctacacata 1680 atgaatacga atgatgaaac caagaacgaa atgatggttg cagccatccg caatgcatgg 1740 gtcaagctgg gcgaacgtgg cgtgcgcatc gtcaataaca gttttggaac aacatcgagg 1800 gcaggcactg ccgacctttt ccaaatagcc aattcggagg agcagtaccg ccaagcgttg 1860 ctcgactatt ccggcggtga taaaacagac gagggtatcc gcctgatgca acagagcgat 1920 tacggcaacc tgtcctacca catccgtaat aaaaacatgc ttttcatctt ttcgacaggc 1980 aatgacgcac aagctcagcc caacacatat gccctattgc cattttatga aaaagacgct 2040 caaaaaggca ttatcacagt cgcaggcgta gaccgcagtg gagaaaagtt caaacgggaa 2100 atgtatggag aaccgggtac agaaccgctt gagtatggct ccaaccattg cggaattact 2160 gccatgtggt gcctgtcggc accctatgaa gcaagcgtcc gtttcacccg tacaaacccg 2220 attcaaattg ccggaacatc cttttccgca cccatcgtaa ccggcacggc ggctctgctg 2280 ctgcagaaat acccgtggat gagcaacgac aacctgcgta ccacgttgct gacgacggct 2340 caggacatcg gtgcagtcgg cgtggacagc aagttcggct ggggactgct ggatgcgggt 2400 aaggccatga acggacccgc gtcctttccg ttcggcgact ttaccgccga tacgaaaggt 2460 acatccgata ttgcctactc cttccgtaac gacatttcag gcacgggcgg cctgatcaaa 2520 aaaggcggca gccaactgca actgcacggc aacaacacct atacgggcaa aaccattatc 2580 gaaggcggtt cgctggtgtt gtacggcaac aacaaatcgg atatgcgcgt cgaaaccaaa 2640 ggtgcgctga tttataacgg ggcggcatcc ggcggcagcc tgaacagcga cggcattgtc 2700 tatctggcag ataccgacca atccggcgca aacgaaaccg tacacatcaa aggcagtctg 2760 cagctggacg gcaaaggtac gctgtacaca cgtttgggca aactgctgaa agtggacggt 2820 acggcgatta tcggcggcaa gctgtacatg tcggcacgcg gcaagggggc aggctatctc 2880 aacagtaccg gacgacgtgt tcccttcctg agtgccgcca aaatcgggca ggattattct 2940 ttcttcacaa acatcgaaac cgacggcggc ctgctggctt ccctcgacag cgtcgaaaaa 3000 acagcgggca gtgaaggcga cacgctgtcc tattatgtcc gtcgcggcaa tgcggcacgg 3060 actgcttcgg cagcggcaca ttccgcgccc gccggtctga aacacgccgt agaacagggc 3120 ggcagcaatc tggaaaacct gatggtcgaa ctggatgcct ccgaatcatc cgcaacaccc 3180 gagacggttg aaactgcggc agccgaccgc acagatatgc cgggcatccg cccctacggc 3240 gcaactttcc gcgcagcggc agccgtacag catgcgaatg ccgccgacgg tgtacgcatc 3300 ttcaacagtc tcgccgctac cgtctatgcc gacagtaccg ccgcccatgc cgatatgcag 3360 ggacgccgcc tgaaagccgt atcggacggg ttggaccaca acggcacggg tctgcgcgtc 3420 atcgcgcaaa cccaacagga cggtggaacg tgggaacagg gcggtgttga aggcaaaatg 3480 cgcggcagta cccaaaccgt cggcattgcc gcgaaaaccg gcgaaaatac gacagcagcc 3540 gccacactgg gcatgggacg cagcacatgg agcgaaaaca gtgcaaatgc aaaaaccgac 3600 agcattagtc tgtttgcagg catacggcac gatgcgggcg atatcggcta tctcaaaggc 3660 ctgttctcct acggacgcta caaaaacagc atcagccgca gcaccggtgc ggacgaacat 3720 gcggaaggca gcgtcaacgg cacgctgatg cagctgggcg cactgggcgg tgtcaacgtt 3780 ccgtttgccg caacgggaga tttgacggtc gaaggcggtc tgcgctacga cctgctcaaa 3840 caggatgcat tcgccgaaaa aggcagtgct ttgggctgga gcggcaacag cctcactgaa 3900 ggcacgctgg tcggactcgc gggtctgaag ctgtcgcaac ccttgagcga taaagccgtc 3960 ctgtttgcaa cggcgggcgt ggaacgcgac ctgaacggac gcgactacac ggtaacgggc 4020 ggctttaccg gcgcgactgc agcaaccggc aagacggggg cacgcaatat gccgcacacc 4080 cgtctggttg ccggcctggg cgcggatgtc gaattcggca acggctggaa cggcttggca 4140 cgttacagct acgccggttc caaacagtac ggcaaccaca gcggacgagt cggcgtaggc 4200 taccggttct gactcgag 4218 58 1403 PRT Artificial Sequence 961cL-983 58 Met Lys His Phe Pro Ser Lys Val Leu Thr Thr Ala Ile Leu Ala Thr 1 5 10 15 Phe Cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp Val Lys Lys 20 25 30 Ala Ala Thr Val Ala Ile Ala Ala Ala Tyr Asn Asn Gly Gln Glu Ile 35 40 45 Asn Gly Phe Lys Ala Gly Glu Thr Ile Tyr Asp Ile Asp Glu Asp Gly 50 55 60 Thr Ile Thr Lys Lys Asp Ala Thr Ala Ala Asp Val Glu Ala Asp Asp 65 70 75 80 Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr 85 90 95 Val Asn Glu Asn Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala Glu 100 105 110 Ser Glu Ile Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala 115 120 125 Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn 130 135 140 Lys Leu Gly Glu Asn Ile Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn 145 150 155 160 Ile Val Lys Ile Asp Glu Lys Leu Glu Ala Val Ala Asp Thr Val Asp 165 170 175 Lys His Ala Glu Ala Phe Asn Asp Ile Ala Asp Ser Leu Asp Glu Thr 180 185 190 Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gln 195 200 205 Thr Ala Glu Glu Thr Lys Gln Asn Val Asp Ala Lys Val Lys Ala Ala 210 215 220 Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr 225 230 235 240 Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp Ile Lys 245 250 255 Ala Asp Ile Ala Thr Asn Lys Asp Asn Ile Ala Lys Lys Ala Asn Ser 260 265 270 Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg Ile 275 280 285 Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser 290 295 300 Ala Glu Lys Ser Ile Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp 305 310 315 320 Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gln Gly Leu Ala Glu 325 330 335 Gln Ala Ala Leu Ser Gly Leu Phe Gln Pro Tyr Asn Val Gly Gly Ser 340 345 350 Gly Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly 355 360 365 Ile Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser 370 375 380 Tyr Ala Gly Ile Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys 385 390 395 400 Ala Gly Arg Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys Ile Asn 405 410 415 Ala Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp 420 425 430 Ala Tyr Lys Asn Leu Ile Asn Leu Lys Pro Ala Ile Glu Ala Gly Tyr 435 440 445 Thr Gly Arg Gly Val Glu Val Gly Ile Val Asp Thr Gly Glu Ser Val 450 455 460 Gly Ser Ile Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr 465 470 475 480 Asn Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro 485 490 495 Glu Asp Gly Gly Gly Lys Asp Ile Glu Ala Ser Phe Asp Asp Glu Ala 500 505 510 Val Ile Glu Thr Glu Ala Lys Pro Thr Asp Ile Arg His Val Lys Glu 515 520 525 Ile Gly His Ile Asp Leu Val Ser His Ile Ile Gly Gly Arg Ser Val 530 535 540 Asp Gly Arg Pro Ala Gly Gly Ile Ala Pro Asp Ala Thr Leu His Ile 545 550 555 560 Met Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala Ile 565 570 575 Arg Asn Ala Trp Val Lys Leu Gly Glu Arg Gly Val Arg Ile Val Asn 580 585 590 Asn Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gln 595 600 605 Ile Ala Asn Ser Glu Glu Gln Tyr Arg Gln Ala Leu Leu Asp Tyr Ser 610 615 620 Gly Gly Asp Lys Thr Asp Glu Gly Ile Arg Leu Met Gln Gln Ser Asp 625 630 635 640 Tyr Gly Asn Leu Ser Tyr His Ile Arg Asn Lys Asn Met Leu Phe Ile 645 650 655 Phe Ser Thr Gly Asn Asp Ala Gln Ala Gln Pro Asn Thr Tyr Ala Leu 660 665 670 Leu Pro Phe Tyr Glu Lys Asp Ala Gln Lys Gly Ile Ile Thr Val Ala 675 680 685 Gly Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu 690 695 700 Pro Gly Thr Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly Ile Thr 705 710 715 720 Ala Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr 725 730 735 Arg Thr Asn Pro Ile Gln Ile Ala Gly Thr Ser Phe Ser Ala Pro Ile 740 745 750 Val Thr Gly Thr Ala Ala Leu Leu Leu Gln Lys Tyr Pro Trp Met Ser 755 760 765 Asn Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gln Asp Ile Gly 770 775 780 Ala Val Gly Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly 785 790 795 800 Lys Ala Met Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala 805 810 815 Asp Thr Lys Gly Thr Ser Asp Ile Ala Tyr Ser Phe Arg Asn Asp Ile 820 825 830 Ser Gly Thr Gly Gly Leu Ile Lys Lys Gly Gly Ser Gln Leu Gln Leu 835 840 845 His Gly Asn Asn Thr Tyr Thr Gly Lys Thr Ile Ile Glu Gly Gly Ser 850 855 860 Leu Val Leu Tyr Gly Asn Asn Lys Ser Asp Met Arg Val Glu Thr Lys 865 870 875 880 Gly Ala Leu Ile Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser 885 890 895 Asp Gly Ile Val Tyr Leu Ala Asp Thr Asp Gln Ser Gly Ala Asn Glu 900 905 910 Thr Val His Ile Lys Gly Ser Leu Gln Leu Asp Gly Lys Gly Thr Leu 915 920 925 Tyr Thr Arg Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala Ile Ile 930 935 940 Gly Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu 945 950 955 960 Asn Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys Ile Gly 965 970 975 Gln Asp Tyr Ser Phe Phe Thr Asn Ile Glu Thr Asp Gly Gly Leu Leu 980 985 990 Ala Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr 995 1000 1005 Leu Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala 1010 1015 1020 Ala Ala His Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gln Gly 1025 1030 1035 1040 Gly Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser 1045 1050 1055 Ser Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp 1060 1065 1070 Met Pro Gly Ile Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala 1075 1080 1085 Val Gln His Ala Asn Ala Ala Asp Gly Val Arg Ile Phe Asn Ser Leu 1090 1095 1100 Ala Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gln 1105 1110 1115 1120 Gly Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr 1125 1130 1135 Gly Leu Arg Val Ile Ala Gln Thr Gln Gln Asp Gly Gly Thr Trp Glu 1140 1145 1150 Gln Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gln Thr Val Gly 1155 1160 1165 Ile Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly 1170 1175 1180 Met Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp 1185 1190 1195 1200 Ser Ile Ser Leu Phe Ala Gly Ile Arg His Asp Ala Gly Asp Ile Gly 1205 1210 1215 Tyr Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser Ile Ser 1220 1225 1230 Arg Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr 1235 1240 1245 Leu Met Gln Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala 1250 1255 1260 Thr Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys 1265 1270 1275 1280 Gln Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn 1285 1290 1295 Ser Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser 1300 1305 1310 Gln Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu 1315 1320 1325 Arg Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly 1330 1335 1340 Ala Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr 1345 1350 1355 1360 Arg Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp 1365 1370 1375 Asn Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gln Tyr Gly Asn 1380 1385 1390 His Ser Gly Arg Val Gly Val Gly Tyr Arg Phe 1395 1400 59 25 DNA Artificial Sequence Oligonucleotide 59 cgcggatccg gagggggtgg tgtcg 25 60 27 DNA Artificial Sequence Oligonucleotide 60 cccgctcgag ttgcttggcg gcaaggc 27 61 25 DNA Artificial Sequence Oligonucleotide 61 cgcggatccg gcggaggcgg cactt 25 62 26 DNA Artificial Sequence Oligonucleotide 62 cccgctcgag gaaccggtag cctacg 26 63 41 DNA Artificial Sequence Oligonucleotide 63 cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41 64 29 DNA Artificial Sequence Oligonucleotide 64 cccgctcgag cgtatcatat ttcacgtgc 29 65 25 DNA Artificial Sequence Oligonucleotide 65 cgcggatccg gagggggtgg tgtcg 25 66 28 DNA Artificial Sequence Oligonucleotide 66 cccgctcgag ttattgcttg gcggcaag 28 67 25 DNA Artificial Sequence Oligonucleotide 67 cgcggatccg gcggaggcgg cactt 25 68 28 DNA Artificial Sequence Oligonucleotide 68 cccgctcgag tcagaaccgg tagcctac 28 69 41 DNA Artificial Sequence Oligonucleotide 69 cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41 70 32 DNA Artificial Sequence Oligonucleotide 70 cccgctcgag ttacgtatca tatttcacgt gc 32 71 42 DNA Artificial Sequence Oligonucleotide 71 cgcggatccg gtggtggtgg tcaaagcaag agcatccaaa cc 42 72 30 DNA Artificial Sequence Oligonucleotide 72 cccaagcttt tcgggcggta ttcgggcttc 30 73 39 DNA Artificial Sequence Oligonucleotide 73 cgcggatccg gtggtggtgg tgccacctac aaagtggac 39 74 28 DNA Artificial Sequence Oligonucleotide 74 gcccaagctt ttgtttggct gcctcgat 28 75 34 DNA Artificial Sequence Oligonucleotide 75 cgcggatccg gtggtggtgg tacaagcgac gacg 34 76 28 DNA Artificial Sequence Oligonucleotide 76 gcccaagctt ccactcgtaa ttgacgcc 28 77 41 DNA Artificial Sequence Oligonucleotide 77 cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41 78 28 DNA Artificial Sequence Oligonucleotide 78 cccaagcttc gtatcatatt tcacgtgc 28 79 44 DNA Artificial Sequence Oligonucleotide 79 cccaagcttg gtggtggtgg tggttcagat ttggcaaacg attc 44 80 29 DNA Artificial Sequence Oligonucleotide 80 cccgctcgag cgtatcatat ttcacgtgc 29 81 45 DNA Artificial Sequence Oligonucleotide 81 cccaagcttg gtggtggtgg tggtcaaagc aagagcatcc aaacc 45 82 28 DNA Artificial Sequence Oligonucleotide 82 cccgctcgag cgggcggtat tcgggctt 28 83 32 DNA Artificial Sequence Oligonucleotide 83 cgcggatccg ctagccccga tgttaaatcg gc 32 84 29 DNA Artificial Sequence Oligonucleotide 84 cggggatcca tcctgctctt ttttgccgg 29 85 36 DNA Artificial Sequence Oligonucleotide 85 cgcggatccg ctagcggaca cacttatttc ggcatc 36 86 30 DNA Artificial Sequence Oligonucleotide 86 cgcggatccc cagcggtagc ctaatttgat 30 87 41 DNA Artificial Sequence Oligonucleotide 87 cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 41 88 28 DNA Artificial Sequence Oligonucleotide 88 cccaagcttc gtatcatatt tcacgtgc 28 89 36 DNA Artificial Sequence Oligonucleotide 89 gcggcgtcga cggtggcgga ggcactggat cctcag 36 90 35 DNA Artificial Sequence Oligonucleotide 90 ggaggcactg gatcctcaga tttggcaaac gattc 35 91 29 DNA Artificial Sequence Oligonucleotide 91 cccgctcgag cgtatcatat ttcacgtgc 29 92 25 DNA Artificial Sequence Oligonucleotide 92 cggggatccg ggggcggcgg tggcg 25 93 30 DNA Artificial Sequence Oligonucleotide 93 cccaagctta tcctgctctt ttttgccggc 30 94 42 DNA Artificial Sequence Oligonucleotide 94 cgcggatccg gtggtggtgg tcaaagcaag agcatccaaa cc 42 95 28 DNA Artificial Sequence Oligonucleotide 95 cccaagcttc gggcggtatt cgggcttc 28 96 26 DNA Artificial Sequence Oligonucleotide 96 ccccaagctt gggggcggcg gtggcg 26 97 31 DNA Artificial Sequence Oligonucleotide 97 cccgctcgag atcctgctct tttttgccgg c 31 98 45 DNA Artificial Sequence Oligonucleotide 98 cccaagcttg gtggtggtgg tggtcaaagc aagagcatcc aaacc 45 99 28 DNA Artificial Sequence Oligonucleotide 99 cccgctcgag cgggcggtat tcgggctt 28 100 35 DNA Artificial Sequence Oligonucleotide 100 ggaggcactg gatccgcagc cacaaacgac gacga 35 101 36 DNA Artificial Sequence Oligonucleotide 101 gcggcctcga gggtggcgga ggcactggat ccgcag 36 102 28 DNA Artificial Sequence Oligonucleotide 102 cccgctcgag acccagcttg taaggttg 28 103 35 DNA Artificial Sequence Oligonucleotide 103 ggaggcactg gatccgcagc cacaaacgac gacga 35 104 36 DNA Artificial Sequence Oligonucleotide 104 gcggcctcga gggtggcgga ggcactggat ccgcag 36 105 28 DNA Artificial Sequence Oligonucleotide 105 cccgctcgag ccactcgtaa ttgacgcc 28 106 38 DNA Artificial Sequence Oligonucleotide 106 gcggcctcga gggatccggc ggaggcggca cttctgcg 38 107 26 DNA Artificial Sequence Oligonucleotide 107 cccgctcgag gaaccggtag cctacg 26 108 35 DNA Artificial Sequence Oligonucleotide 108 ggaggcactg gatcctcaga tttggcaaac gattc 35 109 37 DNA Artificial Sequence Oligonucleotide 109 gcggcgtcga cggtggcgga ggcactggat cctcaga 37 110 29 DNA Artificial Sequence Oligonucleotide 110 cccgctcgag cgtatcatat ttcacgtgc 29 111 35 DNA Artificial Sequence Oligonucleotide 111 gcggcctcga gggatccgga gggggtggtg tcgcc 35 112 25 DNA Artificial Sequence Oligonucleotide 112 cccgctcgag ttgcttggcg gcaag 25 113 35 DNA Artificial Sequence Oligonucleotide 113 ggaggcactg gatccgcagc cacaaacgac gacga 35 114 36 DNA Artificial Sequence Oligonucleotide 114 gcggcctcga gggtggcgga ggcactggat ccgcag 36 115 28 DNA Artificial Sequence Oligonucleotide 115 cccgctcgag acccagcttg taaggttg 28 116 35 DNA Artificial Sequence Oligonucleotide 116 ggaggcactg gatccgcagc cacaaacgac gacga 35 117 36 DNA Artificial Sequence Oligonucleotide 117 gcggcctcga gggtggcgga ggcactggat ccgcag 36 118 28 DNA Artificial Sequence Oligonucleotide 118 cccgctcgag ccactcgtaa ttgacgcc 28 119 35 DNA Artificial Sequence Oligonucleotide 119 ggaggcactg gatcctcaga tttggcaaac gattc 35 120 37 DNA Artificial Sequence Oligonucleotide 120 gcggcgtcga cggtggcgga ggcactggat cctcaga 37 121 29 DNA Artificial Sequence Oligonucleotide 121 cccgctcgag cgtatcatat ttcacgtgc 29 

1. A method for the simultaneous heterologous expression of two or more proteins of the invention, in which (a) two or more proteins of the invention are fused.
 2. The method of claim 23, in which the two or more proteins are: (a) 919 and 287; (b) 953 and 287; (c) 287 and ORF46.1; (d) ORF1 and ORF46.1; (e) 919 and ORF46.1; (f) ORF46.1, 287 and 919; (g) 919 and 519; and (h) ORF97 and
 225. 3. The method of claim 24, in which 287 is at the C-terminal end of protein (a), (b) or (c).
 4. The method of any preceding claim, in which the expression is in an E. coli host.
 5. A protein expressed by the method of any preceding claim.
 6. A hybrid protein of formula NH₂-A-B-COOH, wherein A and B are different Neisserial proteins.
 7. The protein of claim 6, wherein A and B are each selected from orf1, orf4, orf25, orf40, orf46, orf83, 233, 287, 2921, 564, 687, 741, 907, 919, 953, 961 and
 983. 8. The protein of claim 7, wherein A and B are each selected from ORF46.1, 287, 741, 919, 953, 961 and
 983. 9. The protein of claim 8, wherein at least one of said ORF46.1, 287, 741, 919, 953, 961 and 983 is used in essentially full-length form
 10. The protein of claim 8 or claim 9, wherein at least one of said ORF46.1, 287, 741, 919, 953, 961 and 983 has a deletion.
 11. The protein of claim 10, wherein A and/or B has a poly-glycine deletion (‘ΔG’).
 12. The protein of claim 11, wherein A and/or B is ΔG-287, ΔGThp2, ΔG741, or ΔG983.
 13. The protein of claim 10, wherein A and/or B is a truncated protein.
 14. The protein of claim 13, wherein A and/or B is Δ1-287, Δ2-287, Δ3-287 or Δ4-287.
 15. The protein of claim 10, wherein a domain of A and/or B is deleted.
 16. The protein of claim 15, wherein A and/or B is 287B, 287C, 287BC, ORF461-433, ORF46433-608, ORF46, or 961c.
 17. The protein of claim 6, wherein A and B are: (a) 919 and 287; (b) 953 and 287; (c) 287 and ORF46.1; (d) ORF1 and ORF46.1; (e) 919 and ORF46.1; (f) ORF46.1 and 919; (g) 919 and 519; or (h) ORF97 and
 225. 18. The protein of claim 17, wherein the protein is ΔG287-919, ΔG287-953, ΔG287-961, ΔG983-ORF46.1, ΔG983-741, ΔG983-961, ΔG983-961C, ΔG741-961, ΔG741-961C, ΔG741-983, ΔG741-ORF46.1, ORF46.1-741, ORF46.1-961, ORF46.1-961C, 961-ORF46.1, 961-741, 961-983, 961C-ORF46.1, 961C-741, 961C-983, 961CL-ORF46.1, 961CL-741, or 961CL-983.
 19. The protein of claim 8, wherein A or B is
 287. 20. The protein of claim 19, wherein B is 287
 21. The protein of claim 19, wherein A is ΔG-287
 22. The protein of claim 21, wherein B is ORF46, 919, 953 or
 961. 23. The protein of any one of claims 19 to 22, wherein 287 is from strain 2996 or 394/98.
 24. The protein of claim 8, wherein A is
 961. 25. The protein of any one of claims 6 to 24, wherein A and B are from the same strain.
 26. The protein of any one of claims 6 to 24, wherein A and B are joined directly
 27. The protein of any one of claims 6 to 24, wherein A and B are joined via a linker peptide.
 28. The protein of claim 27, wherein the linker peptide is a poly-glycine linker, with the proviso that B is not a AG protein. 